ChatGPT provides false information about people, and OpenAI can’t correct it

alb_004@lemm.ee · 10 months ago

ChatGPT provides false information about people, and OpenAI can’t correct it

givesomefucks@lemmy.world · 10 months ago

If scientists made AI, then it wouldn’t be an issue for AI to say “I don’t know”.

But capitalists are making it, and the last thing you want is it to tell an investor “I don’t know”. So you tell it to make up bullshit instead, and hope the investor believes it.

It’s a terrible fucking way to go about things, but this is America…

DarkThoughts@fedia.io · 10 months ago

This has nothing to do with scientists vs capitalists and everything with the fact that this is not actually “AI”. Someone called it T9 (word prediction) on steroids and I find that much more fitting with how those LLMs work. It just mimics the way humans talk, but it doesn’t actually converse intelligently or actually understands context - it just looks like it does, but only if you take it at face value and don’t look deeper into it.

givesomefucks@lemmy.world · 10 months ago

We don’t even know how real human intelligence/consciousness works…

Obviously modern chatbots aren’t true AI.

But people call a car with an automatic transmission “an automatic”, no one buts in to explain how the entire car isn’t automatic because you still have to use the steering wheel and pedals.

You’re just being too pedantic, and after a couple other people already were about the exact same thing homie…

DarkThoughts@fedia.io · 10 months ago

Why would I call the car automatic if it has an automatic transmission? Your analogy makes zero sense. Your username definitely does not check out either.Maybe stop being a salty snowflake after being called out for not having a clue of what you’re talking about.

givesomefucks@lemmy.world · 10 months ago

Why would I call the car automatic if it has an automatic transmission?

Because you speak English…

Have a nice life tho, anyone that unironically used “snowflake” and gets that defensive of being wrong is worth blocking.

DarkThoughts@fedia.io · 10 months ago

You make even less sense now. lol

anyone that unironically used “snowflake” and gets that defensive of being wrong is worth blocking.

I suppose you’re suggesting that I block you now, since you got so defensive of being wrong. But if I was that type of guy I would’ve already blocked you for being a literal hamas / terrorist apologist / sympathizer.

Meowing Thing@lemmy.world · 10 months ago

It is made by scientists. The problem is that said scientists are paid by investors mostly, or by grants that come from investors.

VeganCheesecake@lemmy.blahaj.zone · 10 months ago

Uh, I understand the sentiment, but the model doesn’t know anything. And it’s legit really hard to differentiate between factual things and random bullshit it made up.

👍Maximum Derek👍@discuss.tchncs.de · 10 months ago

Yeah, no one can make it say “I don’t know” because it is not really AI. Business bros decided to call it that and everyone smiled and nodded. LLMs are 1 small component (maybe) of AI. Maybe 1/80th of a true AI or AGI.

Honestly the most impressive part of LLMs is the tokenizer that breaks down the request, not the predictive text button masher that comes up with the response.

Kichae@lemmy.ca · 10 months ago

Honestly the most impressive part of LLMs is the tokenizer that breaks down the request, not the predictive text button masher that comes up with the response.

Yes, exactly! It’s ability to parse the input is incredible. It’s the thing that has that “wow” factor, and it feels downright magical.

Unfortunately, that also makes people intuitively trust its output.

givesomefucks@lemmy.world · 10 months ago

It “knows” as in it has access to the information and the ability to provide the right info for the right context.

Any part of that process the AI can just “bullshit” and fills in the gaps with random stuff.

Which is what you want when it’s “learning”. You want it to try so it’s attempt can be rated, and the relevant info added to its “knowledge”.

But when consumers are using it, you want it to say “I can’t answer that”. But consumers are usually stupid and will buy/use the one that says “I can’t answer that” the least.

And it’s legit really hard to differentiate between factual things and random bullshit it made up.

Which is why AI should tell end users “I don’t know” more often.

Kichae@lemmy.ca · 10 months ago

It “knows” as in it has access to the information and the ability to provide the right info for the right context.

It doesn’t, though, any more than you have access to the information in a pile of 10 million shredded documents.

givesomefucks@lemmy.world · 10 months ago

Right, in this case that we’re talking about…

Do you not understand how “answer unavailable” is a better answer than taking a small percent of strips of paper at random and filling in the rest with words that sound relevant?

It’s like a mad libs

Ech@lemm.ee · 10 months ago

taking a small percent of strips of paper at random and filling in the rest with words that sound relevant?

It’s like a mad libs

Right. They’re text generators. That’s the technology. It can’t do what you’re demanding because that’s not how it works. LLMs aren’t magic answer machines. They don’t know when to say “answer not available”. They don’t know what they’re being asked. They don’t know anything.

wahming@monyet.cc · 10 months ago

That is what LLMs do in EVERY conversation. Most of the time you don’t notice it, because it fits your expectations.

DudeDudenson@lemmings.world · 10 months ago

Was gonna say, the AI doesn’t make up or admit bullshit, its just a very advanced a prediction algorithm. It responds with what the combination of words that is most likely the expected answer.

Wether that is accurate or not is part of training it but you’ll never get 100% accuracy to any query

maynarkh@feddit.nl · 10 months ago

If it can name what the most likely combination is, couldn’t it also know how likely that combination of words is?

wahming@monyet.cc · 10 months ago

No, because that requires it to understand the words. It doesn’t.

DudeDudenson@lemmings.world · edit-2 10 months ago

It’s not actually deciding anything, the AI thinking is marketing fluff really. But yes that’s called confidence rating and it does. But at the scale of something like chatgpt that uses a snapshot of the entire internet and is non mutable there’s no way to train it for every possible question. If you ask about a topic 99% of the internet gets wrong it’ll respond the wrong thing with 99% confidence

kent_eh@lemmy.ca · 10 months ago

If it has been trained using questionable sources, or if it’s training data includes sarcastic responses (without understanding that context), it isn’t hard to imagine how confidently wrong some of the responses could be.

howrar@lemmy.ca · 10 months ago

It is made by scientists. And we don’t know how to make the model determine whether or not it knows something. So far, we only have tools that tell us that something probably wasn’t in the training set (e.g. using variance across models in a mixture of experts setup), but that doesn’t tell us anything about how correct it is.

set_secret@lemmy.world · 10 months ago

Just put this into GPT 4.

What’s your view of the fizbang Raspberry blasters?

Gpt ‘I’m not familiar with “fizbang Raspberry blasters.” Could you provide more details or clarify what they are?’

It’s a drink making machine from china

Gpt ‘I don’t have any specific information on the “fizbang Raspberry blasters” drink making machine. If it’s a new or niche product, details might be limited online.’

So, in this instance is didn’t hallucinate, i tried a few more made up things and it’s consistent in saying it doesn’t know of these.

Explanations?

expr@programming.dev · 10 months ago

It’s got nothing to do with capitalism. It’s fundamentally a matter of people using it for things it’s not actually good at, because ultimately it’s just statistics. The words generated are based on a probability distribution derived from its (huge) training dataset. It has no understanding or knowledge. It’s mimicry.

It’s why it’s incredibly stupid to try using it for the things people are trying to use it for, like as a source of information. It’s a model of language, yet people act like it has actual insight or understanding.

givesomefucks@lemmy.world · 10 months ago

Imagine searching your computer for a PDF named “W2.2026”…

Would you rather the computer tell you it’s not in the database? Or would you prefer a random PDF displayed with the title “W2.2026”?

This isn’t a new problem.

You’re getting hung up on “know” instead “has relevant information in it’s database and can access it”.

But besides all that and the other things you got wrong:

It’s still about capitalism for the reasons I just said

Zarxrax@lemmy.world · 10 months ago

You don’t seem to understand. There is no database.

givesomefucks@lemmy.world · 10 months ago

https://www.merriam-webster.com/dictionary/analogy

Æther@lemmy.world · 10 months ago

https://en.wikipedia.org/wiki/False_equivalence

expr@programming.dev · 10 months ago

You do not understand how these things actually work. I mean, fair enough, most people don’t. But it’s a bit foolhardy to propose changes to how something works without understanding how it works now.

There is no “database”. That’s a fundamental misunderstanding of the technology. It is entirely impossible to query a model to determine if something is “present” or not (the question doesn’t even make sense in that context).

A model is, to greatly simplify things, a function (like in math) that will compute a response based on the input given. What this computation does is entirely opaque (including to the creators). It’s what we we call a “black box”. In order to create said function, we start from a completely random mapping of inputs to outputs (we’ll call them weights from now on) as well as training data, iteratively feed training data to this function and measure how close its output is to what we expect, adjusting the weights (which are just numbers) based on how close it is. This is a gross simplification of the complexity involved (and doesn’t even touch on the structure of the model’s network itself), but it should give you a good idea.

It’s applied statistics: we’re effectively creating a probability distribution over natural language itself, where we predict the next word based on how frequently we’ve seen words in a particular arrangement. This is old technology (dates back to the 90s) that has hit the mainstream due to increases in computing power (training models is very computationally expensive) and massive increases in the size of dataset used in training.

Source: senior software engineer with a computer science degree and multiple graduate-level courses on natural language processing and deep learning

Btw, I have serious issues with both capitalism itself and machine learning as it is applied by corporations, so don’t take what I’m saying to mean that I’m in any way an apologist for them. But it’s important to direct our criticisms of the system as precisely as possible.

wahming@monyet.cc · 10 months ago

It’s not a database. God, how many years is it going to take before people understand just what LLMs are and are not capable of?