Closed liuweijie19980216 closed 2 years ago
Hello, we use a ViT pretrained with the self-supervised DINO method. This uses ImageNet-1k but without labels. So we do not use any extra labelled data.
Thanks for your reply. Actually, I love the work very much.
I noticed that KNN is trained using the pre-trained VIT model, which means that the work also uses training samples of pre-trained VIT, which seems unfair.