FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
https://arxiv.org/abs/2406.06525
MIT License
1.33k stars 55 forks source link

tokenizer of 4 dim #40

Open DidiD1 opened 4 months ago

DidiD1 commented 4 months ago

Thanks for your work! Would u release the .pth of more VQ-Model version, especially 4 hidden dim? And I wonder how the tokenizer performs with LDM. Thanks for answering!