raoyongming / DynamicViT

[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
https://dynamicvit.ivg-research.xyz/
MIT License
551 stars 69 forks source link

DynamicVIT training stored checkpoint #18

Closed SwapnilDreams100 closed 2 years ago

SwapnilDreams100 commented 2 years ago

Hi, thanks for the great work! I am training the dvit for the deit-small, and the checkpoint it stored has size of 364 MB, while the original model and the weights you shared are ~90MB. Am I making an error?

raoyongming commented 2 years ago

Hi, thanks for your interest in our work. We remove the parameters of the optimizer and ema models from the checkpoints we shared to reduce the file size. Therefore, it is normal that the checkpoints created by the training script are larger.

SwapnilDreams100 commented 2 years ago

ah makes sense, thanks!