lllyasviel / Omost

Your image is almost there!
Apache License 2.0
7.33k stars 420 forks source link

Adding related work on LLM-assisted generation and editing #39

Open thss15fyt opened 5 months ago

thss15fyt commented 5 months ago

Hi, thanks for your contribution to this awesome work! We also study on this topic to leverage LLM planning to generate images with complex scene. The paper is here Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following (CVPR 2024) with open-sourced code . Furthermore, we take more efforts to use it for modifying the generated image with progressive adjustments or chatting-based editing (also discussed in this repo). We hope our exploration could help to further develop the editing function in this repository, e.g. maintaining ID-consistency, understanding complex editing instructions with LLM.

Our framework is illustrated as follows:

image
lllyasviel commented 5 months ago

Thanks! https://github.com/lllyasviel/Omost/commit/bdf51c7a3944bb7dce60263023ac8d7a01c42b80