-48
submitted 2 days ago* (last edited 2 days ago) by ChasingEnigma@lemmy.world to c/nostupidquestions@lemmy.world

I'm interested in automatically generating lengthy, coherent stories of 100,000+ words from a single prompt using an open source local large language model (LLM). I came across the "Awesome-Story-Generation" repository which lists relevant papers describing promising methods like "Re3: Generating Longer Stories With Recursive Reprompting and Revision", announced in this Twitter thread from October 2022 and "DOC: Improving Long Story Coherence With Detailed Outline Control", announced in this Twitter thread from December 2022. However, these papers used GPT-3, and I was hoping to find similar techniques implemented with open source tools that I could run locally. If anyone has experience or knows of resources that could help me achieve long, coherent story generation with an open source LLM, I would greatly appreciate any advice or guidance.

top 28 comments
sorted by: hot top controversial new old
[-] swordgeek@lemmy.ca 5 points 15 hours ago

Don't.

Develop real AI, don't rely on bullshit LLMs.

[-] Deestan@lemmy.world 36 points 2 days ago

This concerns me:

stories of 100,000+ words from a single prompt

An LLM excels at making passable derivative work. It does not, by definition, come up with original ideas.

What are you going to do with 100,000+ words of 100% derivative writing where anything potentially original can be summed up in a prompt of a few dozen words?

Will this be published or sold somewhere? Undercutting or crowding out original works?

[-] BlameThePeacock@lemmy.ca 1 points 16 hours ago

You think Humans aren't pumping out 100% derivative works all the time?

Like every shitty romance novel published. There's only so many ways a man can woo a woman, they just change the location, randomize the set of actions from a list of things men can do to turn women on, throw in something to harm the relationship, and then come up with a set of names.

[-] Deestan@lemmy.world 1 points 9 hours ago

You think Humans aren’t pumping out 100% derivative works all the time?

Don't worry. I don't think that.

A big hope I have for AI is that 100% derivative work by humans is now easier to call out. If a rock with a 9V battery could produce it, why should we value it?

[-] hisao@ani.social -4 points 1 day ago

This is a cool way to put it, but I think even just errors and randomness in reproduction of source ideas sometimes can count as original ideas. Nevertheless, I also think it doesn't fully encompass all range of mechanisms by which humans come up with original ideas.

[-] Deestan@lemmy.world 9 points 1 day ago

Randomness can give novel combinations, sure, but we shouldn't call than an original idea.

As for the various ways humans come up with original ideas, they are based on a level of reflection, reasoning and thought processing. We know that's not possible for an LLM: while they are complex in their details, the way they work is very well defined. They imitate.

[-] hisao@ani.social 2 points 1 day ago

I agree with this in terms of process, but not necessarily agree in terms of result. If you enumerate the state space of target domain, you might realize that all the constructions there can be achieved by randomly introducing errors or modifications to finite set of predefined constructions. Most AI models don't really work like this from what I know (they don't try to randomize inference or introduce errors on purpose), otherwise they could probably evade model collapse. But I don't see why they can't work like this. Humans do often work like this though. A lot of new genres and styles appear when people simply do something inspired by something else, but fail to reproduce it accurately, and when evaluating it they realize they like how it turned out and continue doing that thing and it evolves further by slight mutations. I'm not saying I want AI to do this, or that I like AI or anything, I'm just saying I think this is a real possibility.

[-] hendrik@palaver.p3x.de 1 points 1 day ago* (last edited 1 day ago)

I think so, too. I mean we also have human authors end up at a random camping site somewhere in Europe in the 70s and come up with the random idea of writing "The hitchhikers guide to the galaxy". Either we allow randomness to inspire a novel. Or we'd need to say a lot of old novels aren't original ideas either.

[-] november@lemmy.vg 16 points 1 day ago

Why do you want to do this? What is your end goal?

If it's to read a story, there are already more stories in the world than you could hope to read in your entire lifetime. Written by humans, with actual intention behind them, guaranteed to be coherent.

If it's to create a story, well, you're not creating anything by having an LLM do it for you.

[-] simple@lemm.ee 17 points 2 days ago

You need to use an LLM with a very long context length, potentially 1 million+ tokens. I don't know if any local LLMs can even go that far, and if they can, you'll need an outrageous amount of ram and vram.

But honest question... Why? If you're planning on generating fake books or stories, it's not going to happen, you'll create the most generic barely coherent text.

And fair warning, if you're trying to sell AI generated stories you'll quickly be permabanned from any store, so don't even try it.

[-] kent_eh@lemmy.ca 11 points 1 day ago* (last edited 1 day ago)

Thw 100k word part is relatively easy.

The coherent story part is not possible with today's LLMs, even with a much smaller word count.

Hell, lots of human writers fail at making their stories coherent.

[-] xmunk@sh.itjust.works 14 points 2 days ago

LLM generations of that length tend to go off the rails - I think generating it in chunks where you can try and guide the model back onto the rails it probably a more sane technique.

There are several open source llms to lean on - but for long generations you'll need a lot of memory if you're running it locally.

[-] some_guy@lemmy.sdf.org 7 points 1 day ago

This is why buying books on Amazon now requires checking the author’s background to avoid buying AI slop. I never thought I’d see the day, but it became clear to me last summer.

[-] droporain@lemmynsfw.com 10 points 2 days ago

What is your goal with these stories?

[-] Battle_Masker@lemmy.world 9 points 2 days ago

my guy you gonna plagiarize a whole-ass book using an llm for something THAT big

[-] Hello_there@fedia.io 6 points 1 day ago

Go on fiverr and pay someone

[-] rayquetzalcoatl@lemmy.world 3 points 1 day ago

I am confused as to why you're going through all this struggle! You'll get the same results just copy-pasting big chunks of other books that humans have already put time and effort into writing :) best of luck!

[-] FeelzGoodMan420@eviltoast.org 2 points 1 day ago

Rip society.

this post was submitted on 05 Jan 2025
-48 points (13.6% liked)

No Stupid Questions

36278 readers
876 users here now

No such thing. Ask away!

!nostupidquestions is a community dedicated to being helpful and answering each others' questions on various topics.

The rules for posting and commenting, besides the rules defined here for lemmy.world, are as follows:

Rules (interactive)


Rule 1- All posts must be legitimate questions. All post titles must include a question.

All posts must be legitimate questions, and all post titles must include a question. Questions that are joke or trolling questions, memes, song lyrics as title, etc. are not allowed here. See Rule 6 for all exceptions.



Rule 2- Your question subject cannot be illegal or NSFW material.

Your question subject cannot be illegal or NSFW material. You will be warned first, banned second.



Rule 3- Do not seek mental, medical and professional help here.

Do not seek mental, medical and professional help here. Breaking this rule will not get you or your post removed, but it will put you at risk, and possibly in danger.



Rule 4- No self promotion or upvote-farming of any kind.

That's it.



Rule 5- No baiting or sealioning or promoting an agenda.

Questions which, instead of being of an innocuous nature, are specifically intended (based on reports and in the opinion of our crack moderation team) to bait users into ideological wars on charged political topics will be removed and the authors warned - or banned - depending on severity.



Rule 6- Regarding META posts and joke questions.

Provided it is about the community itself, you may post non-question posts using the [META] tag on your post title.

On fridays, you are allowed to post meme and troll questions, on the condition that it's in text format only, and conforms with our other rules. These posts MUST include the [NSQ Friday] tag in their title.

If you post a serious question on friday and are looking only for legitimate answers, then please include the [Serious] tag on your post. Irrelevant replies will then be removed by moderators.



Rule 7- You can't intentionally annoy, mock, or harass other members.

If you intentionally annoy, mock, harass, or discriminate against any individual member, you will be removed.

Likewise, if you are a member, sympathiser or a resemblant of a movement that is known to largely hate, mock, discriminate against, and/or want to take lives of a group of people, and you were provably vocal about your hate, then you will be banned on sight.



Rule 8- All comments should try to stay relevant to their parent content.



Rule 9- Reposts from other platforms are not allowed.

Let everyone have their own content.



Rule 10- Majority of bots aren't allowed to participate here.



Credits

Our breathtaking icon was bestowed upon us by @Cevilia!

The greatest banner of all time: by @TheOneWithTheHair!

founded 2 years ago
MODERATORS