Closed Dongshengjiang closed 2 years ago
Thanks for your interest in ConvMAE.
Thanks for your repid reply. Is the no mask token MIM pretrain better than contrasive learning for convolution network?
Your question is (pure convolution vs hybrid convolution / transformer vs transformer) or (pure convolution vs masked convolution)?
I mean that does the performance of mae for convolution network( such as resnet50, convnext) is better than traditional contrasive learning methods(such as dino, byol).
Evaluation pure convolution network with different pretraining paradigm such as MAE, DINO, BYOL is beyond the scope of this paper.
have you try pure convolution network? does this work?