I am using the KD to transfer knowledge from the ViT-Base model to ResNet18 on the ImageNet dataset by following the ImageNet/ViT/train.sh script. However, despite training for several epochs, the top-1 accuracy has only reached 46.2% in epoch 8, and it seems to be increasing slowly. This performance is making me concerned about the final accuracy of the model.
I am using the KD to transfer knowledge from the ViT-Base model to ResNet18 on the ImageNet dataset by following the
ImageNet/ViT/train.sh
script. However, despite training for several epochs, the top-1 accuracy has only reached 46.2% in epoch 8, and it seems to be increasing slowly. This performance is making me concerned about the final accuracy of the model.