150

Today's Large Language Models are Essentially BS Machines (quandyfactory.com)

submitted 1 year ago by Veraticus@lib.lgbt to c/technology@beehaw.org

134 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] Veraticus@lib.lgbt 7 points 1 year ago

They don't generate facts, as the article says. They choose the next most likely word. Everything is confidently plausible bullshit. That some of it is also true is just luck.

[-] kogasa@programming.dev 4 points 1 year ago* (last edited 1 year ago)

It's obviously not "just" luck. We know LLMs learn a variety of semantic models of varying degrees of correctness. It's just that no individual (inner) model is really that great, and most of them are bad. LLMs aren't reliable or predictable (enough) to constitute a human-trustable source of information, but they're not pure gibberish generators.

[-] Veraticus@lib.lgbt 2 points 1 year ago

No, it's true, "luck" might be overstating it. There's a good chance most of what it says is as accurate as the corpus it was trained on. That doesn't personally make me very confident, but ymmv.

[-] Zaktor@sopuli.xyz 2 points 1 year ago

That's just not true. Semantic encodings work. It's not like neural networks are some new untested concept, the LLMs have some new tricks under the hood and are way more extensive in their training goal, but they're fundamentally the same thing. All neural networks are mimicry machines enabled and limited by their data, but mimicking largely correct data produces largely correct results when the answer, or interpolatable answers exists in the training data. The problem arises when asked to go further and further afield from their inputs. Some interpolation and substitutions work, but it gets increasingly unreliable the more niche the answer is.

While the LLM hype has very seriously oversold their abilities, the instinctive backlash to say they're useless is similarly way off-base.

[-] Veraticus@lib.lgbt 2 points 1 year ago

No one is saying "they're useless." But they are indeed bullshit machines, for the reasons the author (and you yourself) acknowledged. Their purposes is to choose likely words. That likely and correct are frequently the same shouldn't blind us to the fact that correctness is a coincidence.

[-] Zaktor@sopuli.xyz 0 points 1 year ago

That likely and correct are frequently the same shouldn’t blind us to the fact that correctness is a coincidence.

That's an absurd statement. Do you have any experience with machine learning?

[-] Veraticus@lib.lgbt 1 points 1 year ago

It isn't; I do; do you?

[-] Zaktor@sopuli.xyz 1 points 1 year ago

Yes, it's been my career for the last two decades and before that was the focus of my education. The idea that "correctness is a coincidence" is absurd and either fails to understand how training works or rejects the entire premise of large data revealing functional relationships in the underlying processes.

[-] Veraticus@lib.lgbt 1 points 1 year ago

Or you've simply misunderstood what I've said despite your two decades of experience and education.

If you train a model on a bad dataset, will it give you correct data?

If you ask a question a model it doesn't have enough data to be confident about an answer, will it still confidently give you a correct answer?

And, more importantly, is it trained to offer CORRECT data, or is it trained to return words regardless of whether or not that data is correct?

I mean, it's like you haven't even thought about this.

this post was submitted on 12 Sep 2023

150 points (100.0% liked)

Technology

37809 readers

244 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago

MODERATORS

alyaza@beehaw.org

TheRtRevKaiser@beehaw.org

gyrfalcon@beehaw.org

rs5th@beehaw.org

coldredlight@beehaw.org

Los@beehaw.org

SemioticStandard@beehaw.org

TheRtRevKaiser@kbin.social

remington@beehaw.org