We have to stop ignoring AI’s hallucination problem

misk@sopuli.xyz · 6 months ago

We have to stop ignoring AI’s hallucination problem

FalseMyrmidon@kbin.run · 6 months ago

Who’s ignoring hallucinations? It gets brought up in basically every conversation about LLMs.

Neato@ttrpg.network · 6 months ago

It really needs to be a disqualifying factor for generative AI. Even using it for my hobbies is useless when I can’t trust it knows dick about fuck. Every time I test the new version out it gets things so blatantly wrong and contradictory that I give up; it’s not worth the effort. It’s no surprise everywhere I’ve worked has outright banned its use for official work.

DdCno1@kbin.social · 6 months ago

I agree. The only application that is fine for this in my opinion is using it solely for entertainment, as a toy.

The problem is of course that everyone and their mothers are pouring billions into what clearly should only be used as a toy, expecting it to perform miracles it currently can not and might never be able to pull off.

nyan@lemmy.cafe · 6 months ago

The part that’s being ignored is that it’s a problem, not the existence of the hallucinations themselves. Currently a lot of enthusiasts are just brushing it off with the equivalent of ~~boys will be boys~~ AIs will be AIs, which is fine until an AI, say, gets someone jailed by providing garbage caselaw citations.

And, um, you’re greatly overestimating what someone like my technophobic mother knows about AI ( xkcd 2501: Average Familiarity seems apropos). There are a lot of people out there who never get into a conversation about LLMs.

14th_cylon@lemm.ee · 6 months ago

People who suggest, let’s say, firing employees of crisis intervention hotline and replacing them with llms…

Voroxpete@sh.itjust.works · 6 months ago

Less horrifying conceptually, but in Canada a major airline tried to replace their support services with a chatbot. The chatbot then invented discounts that didn’t actually exist, and the courts ruled that the airline had to honour them. The chatbot was, for all intents and purposes, no more or less official a source of data than any other information they put out, such as their website and other documentation.

SkyezOpen@lemmy.world · 6 months ago

“Have you considered doing a flip as you leap off the building? That way your death is super memorable and cool, even if your life wasn’t.”

-Crisis hotline LLM, probably.