THanks to your excellent work.
I would like to use Vim as backbone for downstream task, I need to load the pretraied weight on Imagenet-1K, then finetune the network on downstream-task dataset,
then, what lr_scheduler and optimizer should I use?
Could you please give me some suggestions about the super-params?
Dear authors,
THanks to your excellent work. I would like to use Vim as backbone for downstream task, I need to load the pretraied weight on Imagenet-1K, then finetune the network on downstream-task dataset, then, what lr_scheduler and optimizer should I use? Could you please give me some suggestions about the super-params?