invoke-ai / InvokeAI

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.
https://invoke-ai.github.io/InvokeAI/
Apache License 2.0
23.75k stars 2.44k forks source link

[enhancement]: Magic Prompts #6719

Open xaroth8088 opened 3 months ago

xaroth8088 commented 3 months ago

Is there an existing issue for this?

Contact Details

No response

What should this feature add?

The ability to use an LLM (e.g. LLama) to take in a short prompt and expound on it to enhance generated image quality.

Alternatives

Running something like GPT4All in parallel, or using ChatGPT/Claude/etc. to generate the enhanced prompts.

Additional Content

ChatGPT/DALL-E 3 does this secretly in the background (though you can get at the actual prompt after the image is generated), and Ideogram has a toggle for this.

Natural follow-on features would include:

stripealipe commented 1 month ago

Yeah, I use similar in some of my Comfy workflows and it's super cool for creative purposes. For Forge and now Invoke I'm typing manually into LM Studio which is of course somewhat slow...

Invoke is great btw - thanks for all the hard work so far!

dennisrall commented 3 days ago

There are also some specialized models for this like [https://huggingface.co/gokaygokay/Flux-Prompt-Enhance]()