Weights from pretrained model cause errors in UCoFiA:
size mismatch for visual_token_selector.score_predictor.in_conv.0.weight: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([1024]).
size mismatch for visual_token_selector.score_predictor.in_conv.0.bias: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([1024]).
size mismatch for visual_token_selector.score_predictor.in_conv.1.weight: copying a param with shape torch.Size([256, 512]) from checkpoint, the shape in current model is torch.Size([512, 1024]).
size mismatch for visual_token_selector.score_predictor.out_conv.0.weight: copying a param with shape torch.Size([256, 512]) from checkpoint, the shape in current model is torch.Size([512, 1024]).
size mismatch for visual_token_selector.score_predictor.out_conv.2.weight: copying a param with shape torch.Size([1, 256]) from checkpoint, the shape in current model is torch.Size([1, 512]).
Weights from pretrained model cause errors in UCoFiA: size mismatch for visual_token_selector.score_predictor.in_conv.0.weight: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([1024]). size mismatch for visual_token_selector.score_predictor.in_conv.0.bias: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([1024]). size mismatch for visual_token_selector.score_predictor.in_conv.1.weight: copying a param with shape torch.Size([256, 512]) from checkpoint, the shape in current model is torch.Size([512, 1024]). size mismatch for visual_token_selector.score_predictor.out_conv.0.weight: copying a param with shape torch.Size([256, 512]) from checkpoint, the shape in current model is torch.Size([512, 1024]). size mismatch for visual_token_selector.score_predictor.out_conv.2.weight: copying a param with shape torch.Size([1, 256]) from checkpoint, the shape in current model is torch.Size([1, 512]).