RuntimeError: Error(s) in loading state_dict for Model:
size mismatch for embedding.word_embedding.weight: copying a param with shape torch.Size([32128, 768]) from checkpoint, the shape in current model is torch.Size([32028, 768]).
size mismatch for tgt_embedding.word_embedding.weight: copying a param with shape torch.Size([32128, 768]) from checkpoint, the shape in current model is torch.Size([32028, 768]).
size mismatch for target.output_layer.weight: copying a param with shape torch.Size([32128, 768]) from checkpoint, the shape in current model is torch.Size([32028, 768]).
使用经过脚本转换后的huggingface上的mengzi-t5-base模型时报错:
参数:
请问是哪里的问题呢,感谢