Ziyang412 / UCoFiA

Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)
https://arxiv.org/abs/2309.10091
MIT License
61 stars 0 forks source link

测试 eval_msrvtt.sh 时,遇到size mismatch报错 #3

Closed lexilii closed 7 months ago

lexilii commented 7 months ago

Weights from pretrained model cause errors in UCoFiA: size mismatch for visual_token_selector.score_predictor.in_conv.0.weight: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([1024]). size mismatch for visual_token_selector.score_predictor.in_conv.0.bias: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([1024]). size mismatch for visual_token_selector.score_predictor.in_conv.1.weight: copying a param with shape torch.Size([256, 512]) from checkpoint, the shape in current model is torch.Size([512, 1024]). size mismatch for visual_token_selector.score_predictor.out_conv.0.weight: copying a param with shape torch.Size([256, 512]) from checkpoint, the shape in current model is torch.Size([512, 1024]). size mismatch for visual_token_selector.score_predictor.out_conv.2.weight: copying a param with shape torch.Size([1, 256]) from checkpoint, the shape in current model is torch.Size([1, 512]).

Ziyang412 commented 7 months ago

Please refer to here https://github.com/Ziyang412/UCoFiA/issues/2#issuecomment-2002585864, thanks!