Open abhijay9 opened 3 years ago
Hello,
The reason for this error is that you are loading the imagenet21k
weights instead of the fine-tuned version, so you need to set num_classes
to 21843.
Note that the fine-tuned version of the ViT-H model is not listed as one of the available models, therefore if you need a model that is fine-tuned on the Imagenet 1k, you can use one of the models listed on the front page of this repository.
I was testing using the provided visualize_attention_map.ipynb
The ViT-B_16-224 loads fine but when I downloaded and was loading ViT-H_14, it gave me the following error:
What do you think might be the error?