200
you are viewing a single comment's thread
view the rest of the comments
[-] Supernova1051@sh.itjust.works 2 points 6 days ago

With your first sentence, I can say you’re wrong.

except i'm not wrong. the model they ran is 4 orders of magnitude smaller than even the smallest "mini" models that are generally available, see TinyLlama1.1B [1] or Phi-3 3.8B mini [2] to compare against. Most "mini" models range from 1 to about 10 Billion parameters, which makes running them incredibly inefficient on older devices.

That doesn’t mean it can’t run it. It just means you can’t imagine that.

but I can imagine it. in fact, I could have told you it would have needed a significantly smaller model in order to run at an adequate pace on older hardware. it's not at all a mystery, its a known factor. i think it's absolutely cool that they did it, but lets not pretend its more than what it is - a modern version of running Doom on non-standard hardware.

[1] https://huggingface.co/TinyLlama/TinyLlama-1.1B-step-50K-105b

[2] https://ollama.com/library/phi3:3.8b-mini-128k-instruct-q5_0

[3] https://www.thirtythreeforty.net/posts/2019/12/my-business-card-runs-linux/

this post was submitted on 30 Dec 2024
200 points (92.0% liked)

Technology

60261 readers
3146 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 2 years ago
MODERATORS