ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities from large language models to build a high-quality textual semantic representation for text-to-image generation.
MIT License
111
stars
2
forks
source link
Is there a chance that we could get an open-source dataset? #3