YangLing0818 / RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
https://proceedings.mlr.press/v235/yang24ai.html
MIT License
1.7k stars 99 forks source link

intermediate results #56

Open WaiBiBaBolmc opened 2 months ago

WaiBiBaBolmc commented 2 months ago

Thank you for your efforts; this is a very creative piece of work. Do you have plans to open-source the intermediate results, such as the prompt recaption outcomes and the corresponding regional partition codes?