ocdevel / gnothi

Gnothi is an open-source AI journal and toolkit for self-discovery. If you're interested in getting involved, we'd love to hear from you.
https://gnothiai.com
GNU Affero General Public License v3.0
166 stars 18 forks source link

Image generation #161

Open lefnire opened 1 year ago

lefnire commented 1 year ago

This would be super cool, but not something we're gonna focus on immediately. Adding here in case someone wants to take a stab.

It would be great to optionally generate an image to go along with journal entries. Stable Diffusion, MidJourney, DALLE-2. If tied to DALLE-2, we'll just piggy-back on the premium OpenAI setup. For Stable Diffusion, we'll follow the same system as #160 - BYO model (IP/ngrok) and we'll host a first-come/first-serve BYO of our own.

The tricky part is the "oddities" around these outputs. I tried running a few of my dreams through DALLE-2, hoping we could easily implement this feature - especially for dream visualization. The output was morbid and grotesque. Very displeasing, especially for someone who just wants to give it a spin (as opposed to knows what they're up against).

So one solution is to first take an entry, pipe it through the LLM (BYO #160 or GPT) in order to craft a safe and focused Stable Diffusion prompt. We'll need a good prompt for creating the prompt, lol - something like "using stable diffusion's prompt syntax, reconstruct the following journal entry to be visualized as an image. Here is a sample prompt for stable diffusion: ".

Anyway, just kicking off the convo here in case others have feedback / ideas. I'm not gonna work on this personally for some time, unless it's a really desired feature.