FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
https://arxiv.org/abs/2406.06525
MIT License
1.32k stars 56 forks source link

Why is the model GPT in the code? #64

Open wangyf8848 opened 1 month ago

wangyf8848 commented 1 month ago

Hello, the paper mentions using Llama for autoregressive training, but why is the language model in the code using GPT ?

BaohaoLiao commented 1 month ago

It's only a name. The GPT architecture in this project has the same arch as Llama