jeonsworld / ViT-pytorch

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
MIT License
1.95k stars 374 forks source link

fixed typos #43

Open topsy404 opened 2 years ago

topsy404 commented 2 years ago

Maybe we should decrease the batch size by the value of --train_batch_size if We do not have enough memory?