Closed zhl98 closed 1 year ago
yes, you can see this config of EVA-g.
Thank you very much for your reply. However, I still encountered a problem when using my patch_ The size is 14.
I would like to inquire about deform_inputs1, deform_inputs2 don't think the variables are learnable parameters in the code, or are they calculated from the input. Do they have no impact as long as the dimensions match? What dimensions should they be?
You should resize the patch embedding from 14 to 16. Like these models: https://github.com/czczup/ViT-Adapter/tree/main/detection/configs/mask_rcnn/dinov2
Using this script to resize patch embedding from 14x14 to 16x16: https://github.com/czczup/ViT-Adapter/blob/main/detection/convert_14to16.py
@czczup: Shouldn't we also transform pos_embed
weights to 16 x 16? (currently converting just patch_embed)
Hello
Thank you very much for your work. I would like to know how to add adapter to other sizes of vits, such as vit g?