OliverRensu / TinyMIM

151 stars 7 forks source link

The pos_embed's dimension of checkpoint `TinyMIM-FT-Tstar.pth` #6

Closed RockeyCoss closed 1 year ago

RockeyCoss commented 1 year ago

Thanks for your wonderful work! By the way, could you please tell me why the length of the pos_embed parameters in the checkpoint TinyMIM-FT-Tstar.pth is 198(14*14+2) instead of 197(14*14+1)? Thank you very much!

OliverRensu commented 1 year ago

We take knowledge distillation like DeiT in finetuning TinyMIM-Tstar, therefore, there is an extra distillation token.

RockeyCoss commented 1 year ago

Thank you very much! Is the first token the distillation token?

OliverRensu commented 1 year ago

No, the second token is the distillation token. You can refer https://github.com/OliverRensu/TinyMIM/blob/master/Segmentation/TinyMIMstar-T/backbone/mae.py

RockeyCoss commented 1 year ago

I see, thank you very much!