Open islam-nassar opened 2 years ago
@islam-nassar
Yes we tried with a ViT backbone! In short it worked out-of-the-box with the following setup (similar to DINO):
Evaluation: soft NN 10% labels (no fine-tuning):
*Although you can probably just use a constant WD value, i'm not sure the increasing schedule was that important in this experiment.
Let me know if there's some other information about the setup you need that I forgot to mention!
Hi Mido,
Thanks for the excellent work and thanks for sharing. I was curious if you have tried using a ViT backbone to test PAWS with a transformer backbone. I was wondering cause your concurrent work (DiNO) and others use ViT so I was hoping you have done that. If not, do you reckon it will be straight forward to do that by adjusting the model in your code or you foresee bigger implications?
Cheers