Nice work! You use a small but high resolution network for the TinyImageNet. For example, the patch size of ViT is 8 and the window size of Swin is 4. When come to the ImageNet, the window size of Swin is 4 does not work. So what parameter you use for the ImageNet. Could you please give me some details about ViT and Swin?
Nice work! You use a small but high resolution network for the TinyImageNet. For example, the patch size of ViT is 8 and the window size of Swin is 4. When come to the ImageNet, the window size of Swin is 4 does not work. So what parameter you use for the ImageNet. Could you please give me some details about ViT and Swin?