I loaded the pretrained weights “deit_tiny_patch16_224-a1311bcf.pth”, but there is an error "size mismatch for head.weight: copying a param with shape torch.Size([1000, 192]) from checkpoint, the shape in current model is torch.Size([100, 192])", I have modified the "--num-classes" to 100
Hi, the checkpoint of DeiT is for ImageNet dataset. Teacher models for CIFAR-100 are finetuned from ImageNet models. You can refer to this issue for more details.
I loaded the pretrained weights “deit_tiny_patch16_224-a1311bcf.pth”, but there is an error "size mismatch for head.weight: copying a param with shape torch.Size([1000, 192]) from checkpoint, the shape in current model is torch.Size([100, 192])", I have modified the "--num-classes" to 100