Sex offender banned from using AI tools in landmark UK case

girlfreddy@lemmy.ca · 10 months ago

Sex offender banned from using AI tools in landmark UK case

Tippon@lemmy.dbzer0.com · 10 months ago

Where does the training data come from to create indecent images of children?

Turun@feddit.de · 10 months ago

Ai is able to fill in the last field in a table like “Old / young” vs “Clothed / naked” when given three of the four fields.

randomaside@lemmy.dbzer0.com · 10 months ago

Please reiterate your statement but instead using the “goose chase meme” format.

Dran@lemmy.world · edit-2 10 months ago

It doesn’t need csam data for training, it just needs to know what a boob looks like, and what a child looks like. I run some sdxl-based models at home and I’ve observed it can be difficult to avoid more often than you’d think. There are keywords in porn that blend the lines across datasets (“teen”, “petite”, “young”, “small” etc). The word “girl” in particular I’ve found that if you add that to basically any porn prompt gives you a small chance of inadvertently creating the undesirable. You have to be really careful and use words like “woman”, “adult”, etc instead to convince your image model not to make things that look like children. If you’ve ever wondered why internet-based porn generators are on super heavy guardrails, this is why.

Tippon@lemmy.dbzer0.com · 10 months ago

Thanks for the reply, it’s given me a good idea of what’s most likely happening :)

It’s a shame that the rest of the thread went to shit, but unfortunately it’s an emotional topic, and brings out emotional responses

Dran@lemmy.world · 10 months ago

Always happy to try and productively add to someone’s learning.

xmunk@sh.itjust.works · 10 months ago

It is true, a 10 year old naked woman is just a 30 year old naked woman scaled down by 40%. /s

No buddy, there isn’t some vector of “this is the distance between kid and adult” that a model can apply to generate what a hypothetical child looks like. The base model was almost certainly trained on more than just anatomical drawings from Wikipedia - it ate some csam.

If you’ve seen stuff about “Hitler - Germany + Italy = Mousillini” for models where that’s true (which is not universal) it takes an awful lot of training data to establish and strengthen those vectors. Unless the generated images were comically inaccurate then a lot of training went into this too.

redlue@startrek.website · edit-2 10 months ago

Removed by mod

rebelsimile@sh.itjust.works · 10 months ago

Right, and the google image ai gobbled up a bunch of images of black george washington, right? They must have been in the data set, there’s no way to blend a vector from one value to another, like you said. That would be madness. Nope, must have been copious amounts of asian nazis in the training set, since the model is incapable of blending concepts.

xmunk@sh.itjust.works · 10 months ago

You’re incorrect and you should fucking know better.

I have no idea why my comment above was downvoted to hell but AI can’t “dream up” what a naked young person looks like. An AI can figure that adults wear different clothes and put a black woman in a revolutionary war outfit. These are totally different concepts.

You can downvote me if you like but your AI generated csam is based on real csam so fuck off. I’m disappointed there is such a large proportion of people defending csam here especially since lemmy should be technically oriented - I expect to see more input from fellow AI fluent people.

rebelsimile@sh.itjust.works · 10 months ago

You’re spreading misinformation and getting called out for it.

xmunk@sh.itjust.works · 10 months ago

It isn’t misinformation, though, generative AI needs a basis for it’s generation.

rebelsimile@sh.itjust.works · edit-2 10 months ago

The misinformation you’re spreading is related to how it works. A generative AI system will (without prompting away from it) create people with 3 heads, 8 fingers on each hand and multiple legs connecting to each other. Do you think it was trained on that? This argument of “it can generate it, therefore it was trained on it” is ridiculous. You clearly don’t understand how it works.

xmunk@sh.itjust.works · 10 months ago

Just a note - csam has been found in model training sets: https://cyber.fsi.stanford.edu/news/investigation-finds-ai-image-generation-models-trained-child-abuse

rebelsimile@sh.itjust.works · 10 months ago

Ok? Hundreds of images of anything isn’t going to necessarily train a model based on billions of images. Have you ever tried to get Stable Diffusion to draw a bow and arrow? Just because it has ever seen something doesn’t mean that it has learned it, nor, more importantly, does that mean that is the way it learned it, since we can see that it can infer many concepts from related concepts- pregnant old women, asian nazis, black george washingtons (NONE OF WHICH actually have ever existed or been photographed)… is unclothed children really more of a leap than any of those?

xmunk@sh.itjust.works · 10 months ago

It is, yes. A black George Washington is one known visual motif (a George Washington costume) combined with another known visual motif. A naked prepubescent child isn’t just the combination of “naked adult” and “child” naked children don’t look like naked adults simply scaled down.

AI can’t tell us what something we’ve never seen looks like… a kid who knows what George Washington and a black woman looks like can imagine a black George Washington. That’s probably a helpful analogy, AI can combine simple concepts but it can’t innovate - it can dream, but it can’t know something that we haven’t told it about.