Thanks for the useful repo. I was going through the code, and upon inspection I saw that Vim-T and Vim-S configurations have double the number of blocks (depth=24) whereas both Tiny and Small configurations for ViT/DeiT in timm have depth=12. Is there a reason for this disparity?
Thanks for the useful repo. I was going through the code, and upon inspection I saw that Vim-T and Vim-S configurations have double the number of blocks (
depth=24
) whereas both Tiny and Small configurations for ViT/DeiT in timm havedepth=12
. Is there a reason for this disparity?