YangLing0818 / RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
https://proceedings.mlr.press/v235/yang24ai.html
MIT License
1.7k stars 99 forks source link

WebUI extension is ready #28

Closed zydxt closed 9 months ago

zydxt commented 9 months ago

Hey folks, I created an extension sd-webui-rpg-diffusionmaster I added Gemini Pro and OpenAI Azure support but the local LLM support is still on the way. Hope this can help you playing with RPG-DiffusionMaster

The extension's code structure was quite different and I'm not sure if I should try to merge it into this repo. Let me know if you have any suggestion

YangLing0818 commented 9 months ago

good try!