YangLing0818 / RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
https://proceedings.mlr.press/v235/yang24ai.html
MIT License
1.7k stars 99 forks source link

Can you provide more details on how you enhanced stability for extracting regional prompts? #55

Closed andupotorac closed 5 months ago

andupotorac commented 5 months ago

We're planning to use RPG in our own project as well, and if yours is a fix to their code - and it's not merged with their branch - we'd like to understand what the issue was, and how you fixed it, so we can apply the same patch on their code too.

Thanks!

andupotorac commented 5 months ago

Sorry, thought I was posting at https://github.com/zydxt/sd-webui-rpg-diffusionmaster/.