Open HarryHuangYZ opened 3 months ago
RuntimeError: Error(s) in loading state_dict for Transformer: size mismatch for tok_embeddings.weight: copying a param with shape torch.Size([16032, 16384]) from checkpoint, the shape in current model is torch.Size([32064, 8192]).
same, how do you resolve it?
RuntimeError: Error(s) in loading state_dict for Transformer: size mismatch for tok_embeddings.weight: copying a param with shape torch.Size([16032, 16384]) from checkpoint, the shape in current model is torch.Size([32064, 8192]).