FoundationVision / VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
MIT License
3.78k stars 285 forks source link

T2I Generation #64

Open yang326922943 opened 1 month ago

yang326922943 commented 1 month ago

When the T2I code can be released, thank you

keyu-tian commented 1 month ago

We have no firm plans. I personally hope it will be ready by July.

kl2004 commented 1 week ago

A new paper similar to VAR but for t2i: https://krennic999.github.io/STAR/