AI (LLMs and image generators, really) is trained on human-made material scraped from the entire internet. As more and more communities and sites online are filled with AI-generated content, AI training is in danger of getting stuck in a loop, trained on its own output again and again.
Communities that are harshly purist in their anti-AI rules are thus an excellent source of curated training data for AI while the rest of the internet becomes unusable for the task.
Perhaps AI companies and their products are even deliberately so annoying, shitty and repulsive, because they want to spark some resistance, have a part of the population reject AI and enforce anti-AI rules and communities.


Rule 2 is a little gray but if the title reads like a catchy headline, then it’s probably not a complete thought. Another way to think about it, the title should make you want to read more because it’s a good thought, not because it a cliff hanger.
OP, would you mind updating the title with a little more context?
For example if you added “because they are providing training data” to the title. It would seem more like a complete thought to me.
Nope. Entire thought needs to be in headline
Changed it, a bit long now but I guess it’s ok?