352

it will loose its ability to differentiate between there and their and its and it’s.

you are viewing a single comment's thread
view the rest of the comments
[-] driving_crooner@lemmy.eco.br 12 points 7 months ago

ChatGPT was already trained on Reddit data. Check this video to see how one reddit username caused bugs on it: https://youtu.be/WO2X3oZEJOA?si=maWhUpJRf0ZSF_1T

[-] TexasDrunk@lemmy.world 3 points 7 months ago

I'm not gonna watch, but I assume little Bobby Tables strikes again.

[-] driving_crooner@lemmy.eco.br 2 points 7 months ago

It's about the counting subreddit. It was used on the token generation database, but then removed on the training. This user posted so much on that subreddit that a token with its username was created, but then it had nothing associated with it in the training and the model dosen't know how to act when the token is present.

[-] PipedLinkBot@feddit.rocks 2 points 7 months ago

Here is an alternative Piped link(s):

https://piped.video/WO2X3oZEJOA?si=maWhUpJRf0ZSF_1T

Piped is a privacy-respecting open-source alternative frontend to YouTube.

I'm open-source; check me out at GitHub.

this post was submitted on 07 Mar 2024
352 points (93.8% liked)

Showerthoughts

29363 readers
535 users here now

A "Showerthought" is a simple term used to describe the thoughts that pop into your head while you're doing everyday things like taking a shower, driving, or just daydreaming. The best ones are thoughts that many people can relate to and they find something funny or interesting in regular stuff.

Rules

founded 1 year ago
MODERATORS