using the utils.load_state_dict() function, I get multiple errors, including:
_size mismatch for patchembed.proj.weight: copying a param with shape torch.Size([768, 3, 2, 16, 16]) from checkpoint, the shape in current model is torch.Size([768, 3, 16, 16]).
I assume this might be because the tubelet size is missing, which by default is set to 2 (and could be the dimension I am missing). So I guess the main question is, how to load the model (and which model)?
How can one load and use the pre-trained distilled models from the model zoo?
First, creating the model using (needed to comment out all non-default params as they are not recognized):
When I am trying to load the weights: https://pjlab-gvm-data.oss-cn-shanghai.aliyuncs.com/internvideo/distill/vit_s_k710_dl_from_giant.pth
using the utils.load_state_dict() function, I get multiple errors, including: _size mismatch for patchembed.proj.weight: copying a param with shape torch.Size([768, 3, 2, 16, 16]) from checkpoint, the shape in current model is torch.Size([768, 3, 16, 16]).
I assume this might be because the tubelet size is missing, which by default is set to 2 (and could be the dimension I am missing). So I guess the main question is, how to load the model (and which model)?
Any help appreciated, thanks!