After having experimented quite a lot with AI content generation (both images and text), it’s got me thinking about what are the next level applications for story-tellers and world-builders.

The first is obvious: integrate AI text and image generation tools directly into writing/publishing software (WordPress? Medium? Scrivener? Vellum?). Hm, it does appear there is a WP Stable Diffusion tool you can use directly within blocks. I think Novel AI also lets you generate text or images (and there was some big stupid kerfuffle about them in the SD community that I largely ignored the details of).

The second (or alternate first) is like Playground AI is doing, where you can use either Dall-E or SD models right from within the same UI. TextSynth does this for open source alternatives to GPT-3.

So as a writer/world-builder, you’d want to be able to effortless switch between generation tools all from within the same comfortable UI that you’re used to, or that you prefer from available tools. And then you’d string together and be able to edit & reconfigure items in the composition (sequence) you are creating.

But then there’s another couple layers on top of that, which I’m calling maybe the metaversal and evolutionary layers.

The metaversal or world-building layer would be the web of connections, entity names, attributes & relationships. You could enter whatever training data you want there, and it would also automatically track, cross-reference, etc. (including compares/diffs – not just file versions, but I’m thinking about “factual” references in-world across diverse documents)

The evolutionary layer would then be like, you could walk away from the app, and it would sort of automatically progress or generate within certain constraints customizable by the user, based on the content in its training data, or that has already been auto-generated and approved by the user.

So when you come back, it would present you with various events, suggestions, problems, checkpoints, or other evolutionary decisions. As the user, you could peruse them, and rate them, choose whether they are right/wrong for the world, and rate them according to some scale of how canonical they are. You could save or trash new text and image developments. Spin out, cancel, or merge timelines.

Also included would be character AIs where at key points, you could interact by chat messages with characters. You could do it as yourself to train/instruct characters (e.g., stage directions), or you could do it in-character (using the guise of someone else in the world), and this could become the basis for canonical text.

You would be able to take all these artifacts, and arrange them into groups, and sequences, and export them to various formats to work with other applications (such as outputting files for an ebook or a blog post, or even an audiobook or video down the road).

In looking around for World-Building AIs, there are certain tools out there that do aspects of this, but none of them all of it together. What I’ve also learned as a user and worker in software is that I don’t really want to trust someone else’s paid subscription service. What happens if they go out of business? What happens if they turn off a key feature?

WordPress is one of the only models for me that still holds any water, where I can run my own instance, and be in complete control of it, and host it myself. There’s a big developer community & regular updates, but I don’t have to take them if I don’t want, or I can augment on my own. Running locally would also be probably a desirable option. I’ve seen people mention Obsidian in a similar context for running locally, and being highly extensible (runs on markdown, I believe), but I never got into it when I tinkered with it, and it lacks all these evolutionary capacities I mentioned, which is what would take this all to the next level.