Why is the model GPT in the code？

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

https://arxiv.org/abs/2406.06525

MIT License

1.32k stars 56 forks source link

Open wangyf8848 opened 1 month ago

wangyf8848 commented 1 month ago

Hello, the paper mentions using Llama for autoregressive training, but why is the language model in the code using GPT ？

BaohaoLiao commented 1 month ago

It's only a name. The GPT architecture in this project has the same arch as Llama