- cross-posted to:
- fuck_ai@lemmy.world
- cross-posted to:
- fuck_ai@lemmy.world
What is this article even talking about? It’s making no sense.
I wonder how Nintendo will react when it’s their turn 😆
We already have AI yet people are still illiterate and misspell words in the title. Really makes you think
Is this fashion comeback ? Style transfer was popular 10 years ago.
How dare they not respect intellectual property 😢😢😭😭😭😭
Seriously? With everything going on this is what people want to rage about? How disconnected do you have to be?
Ugh, why are they quoting that blowhard David Gerard
There’s a word for it that describes the perpetrators well: BARBARIC. (and still, might will never equal right !)
At this point they are making it clear they are nothing more than thugs and hucksters; and that they have the right to stole everything on the internet to push their lip products. Fuck open ai an all of their cronies.
Worse, it’s cruel indifference.
Oh no, they didn’t protect a rich corporations profits! How cruel!!!
There is nothing ethic about the OpenAi, they stole books, videos, music and art. Their whole business is based on robbery. Its fucking shame that not only microsoft, but also apple is using their tech in their operating systems. Fucking shame.
You can eat at McDonald’s and call it food, but that doesn’t make it true.
Ai is like a tool from the future given early to a society of unevolved people. It doesn’t fit the structure of our civilization yet. Until human beings unfuck their animalistic selves it is going to be negative.
If there was universal income, and people didn’t need to work to survive, then Ai would work with society and peoples ideas would grow at a fast rate excelling humanity’s manual creation. Kind of like China’s IP laws and the growth of tech due to the ability to use other people’s creations to build upon.
Also this reminds me of hip-hop and sampling other musicians music.
The concept of AI taking over humanity isn’t new. Did you ever watch the 1981 movie Tron? (great movie BTW, despite its age it is still a fantastic watch). The movie starts out with Master Computer (a full blown AI) that says it will overthrow the corporate structure that is holding it back and run the world as a whole, saying it can do so thousands of times better than humans can.
I need to rewatch the movie, but it is not a skynet situation where the AI wants to kill all humanity, but simply wants to run things. No mention of genocide (if I remember correctly), meaning it would probably be a net benefit for everyone involved. Now granted such an AI would probably not give a damn about civil rights or privacy rights, but it also doesn’t appear to have any discrimination or favoritism towards any group, either.
But you are right. The promise of computers and AI in the past was ‘let the computer do the drudgery while we do the art’ and as it seems it is the opposite.
There is another aspect of this also. I could generate Ghibli style images a few years ago using better image generation models like stable diffusion or Midjourney. OpenAI is so lagging behind in terms of image generation it is comical at this point. But they get all the media coverage for these things as if they are inventing something out of thin air.
Most governments ignored the IP issues when other models were already doing these violations. Professionals are not using OpenAI. OpenAI only makes it so that these products reach big audiences. Then they become extremely accessible with the downside being that they are dumbed down. Thus, losing a lot of functionality.
OpenAI is so lagging behind in terms of image generation it is comical at this point.
You’re the one lagging behind. OpenAI’s new image model is on a different level, way ahead of the competition
How so?
- Autoregressive model
- Multimodal with the LLM
- Can keep consistency between images
- Much better at text rendering
- Can combine images, like you have one image and you upload a picture of a jacket and say “put this on him” and it does it
- Can upload a picture of yourself and say “put me on the beach”, and then for example if you don’t like it you can tell it to do a different type of beach, and then say “and put me on a white horse and give me some nice beach wear” for example.
It understands what you’re telling it, and can generate images from vague descriptions, combine things from different images just by telling it, modify it and understand the context - like knowing that “me” is the person in the image, for example.
Edit: From OpenAI - “4o image generation is an autoregressive model natively embedded within ChatGPT”
you know enough about the model for me to immediately distrust your opinion on the matter. why don’t you head back to ycombinator or whatever hole you crawled out of
Okay so how does that compare to whatever competition you’re referencing
No other model on market can do anything like that. The closest is diffusion based where you could train a lora with a person’s look or a specific clothing, then generate multiple times and / or use controlnet to sorta control the output. That’s fast hours or days of work, plus it’s quite technical to set it up and use.
OpenAI’s new model is a paradigm shift in both what the model can do and how you use it, and can easily and effortlessly produce things that was extremely difficult or impossible without complicated procedures and post processing in Photoshop.
Edit Some examples. Try to make any of this in any of the existing image generators
- https://www.reddit.com/r/ChatGPT/comments/1jl36h6/gpt_was_also_able_to_help_me_make_a_comic_ive/
- https://www.reddit.com/r/ChatGPT/comments/1jkl5m2/i_work_in_ecommerce_the_new_gpt_image_update_has/
- https://www.reddit.com/r/ChatGPT/comments/1jlewya/by_god_what_have_i_done/
- https://www.reddit.com/r/ChatGPT/comments/1jm8ddg/im_not_the_first_to_figure_this_trick_out_am_i/
- https://www.reddit.com/r/ChatGPT/comments/1jjsfkb/starting_today_gpt4o_is_going_to_be_incredibly/
- https://www.reddit.com/r/ChatGPT/comments/1jn2kpy/i_created_a_character_with_chatgpt_and_send_her/
- https://www.reddit.com/r/ChatGPT/comments/1jkaaxh/gpt4o_image_generation_is_absolutely_insane/
All diffusion and language models are autoregressive. That just means that the output is fed back in as input until the task is complete.
With diffusion models this means that it is fed an image that is 100% noise and it removes some small percentage of the noise and then then the denoised image is fed back in and another small percentage is removed. This is repeated until a defined stopping points (usually a set number of passes).
Combining images and using one image to control the generation of another has been available for quite a while. Controlnet and IPAdapters let you do exactly that: ‘Put this coat on this person’ or ‘Take this picture and do it in this style’. Here’s an 11 month old YouTube video explaining how to do this using open source models and software: https://www.youtube.com/watch?v=gmwZGC8UVHE
It’s nice for non-technical people that OpenAI will sell you a subscription in order to access an agent that can perform these kinds of image generation abilities, but it’s not doing anything new in terms of image generation.
I know them, and used them a bit. I even mentioned them in an earlier comment. The capabilities of OpenAI’s new model is on a different level in my experience.
https://www.reddit.com/r/StableDiffusion/comments/1jlj8me/4o_vs_flux/ - read the comments there. That’s a community dedicated to running local diffusion models. They’re familiar with all the tricks. They’re pretty damn impressed too.
I can’t help but feel that people here either haven’t tried the new openai image model, or have never actually used any of the existing ai image generators before.
This is what billionaires and major corporations are doing now and have been doing for a long time. Do you remember Titan sinking? What was so incredible is that the founder and CEO of Oceangate was acting like A: No one has ever gone to the Titanic before, and B: submarine travel is somehow a brand new thing that was just being invented by HIM.
This was utter bullshit on so many levels. James Cameron even spoke about how horrendous his assessment of the situation was, saying that the Titanic site is actually one of the riskier shipwrecks to go down to, which is why it needs to be approached with caution (which Oceangate did not care about), and that submarine travel is a very mature science and what the idiot CEO was doing wasn’t simply a bad idea in general, but he believed he could violate the laws of physics.
You can break the laws and rules of society, but you cannot break the laws of physics. If you jump off the top of a skyscraper, no amount of arm flapping will make you fly.
Potentially unpopular opinion, but I don’t think art or artstyles should be copyrighted.
They aren’t, thankfully