yitu-opensource / T2T-ViT

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Other
1.15k stars 176 forks source link

ImageNet100 very low accuracy #36

Closed Evgeneus closed 3 years ago

Evgeneus commented 3 years ago

Dear authors,

I would like to run T2T on ImageNet100 on 2 gpus. But I have gotten just 8.5 in top-1 accuracy after 200 epochs! Also the train loss is high. Do you know what can be a reason for that?

yuanli2333 commented 3 years ago

Hi,

We also trained our T2T-ViT on other datasets like CIFAR100 from scratch, and got reasonable results (77%-80%). So I am not sure why your training not work on ImageNet100 without enough information.

You can also borrow some training method from our transfer learning or other implementations like this one, which only train 60 epoches but still achieve accuracy > 70%.