OpenMotionLab / MotionGPT

[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
https://motion-gpt.github.io
MIT License
1.46k stars 91 forks source link

Shape mismatch in load.state_dict() #62

Open santurini opened 10 months ago

santurini commented 10 months ago

I am trying to running demo.py on some personalized prompts but I am not able to load the model as I get the following error:

RuntimeError: Error(s) in loading state_dict for MovementConvEncoder:
    size mismatch for main.0.weight: copying a param with shape torch.Size([512, 259, 4]) from checkpoint, the shape in current model is torch.Size([512, 18, 4]).

And the command I am using is:

python demo.py --cfg ./configs/config_h3d_stage3.yaml --example ./demos/t2m.txt
zsolt-csibi commented 8 months ago

I have the same issue, with different dimension:

RuntimeError: Error(s) in loading state_dict for MovementConvEncoder:
        size mismatch for main.0.weight: copying a param with shape torch.Size([512, 259, 4]) from checkpoint, the shape in current model is torch.Size([512, 247, 4]).
duan348733684 commented 7 months ago

I have the same issue, with different dimension:

RuntimeError: Error(s) in loading state_dict for MovementConvEncoder:
        size mismatch for main.0.weight: copying a param with shape torch.Size([512, 259, 4]) from checkpoint, the shape in current model is torch.Size([512, 247, 4]).

@zsolt-csibi have you solved this problem?

duan348733684 commented 7 months ago

I am trying to running demo.py on some personalized prompts but I am not able to load the model as I get the following error:

RuntimeError: Error(s) in loading state_dict for MovementConvEncoder:
  size mismatch for main.0.weight: copying a param with shape torch.Size([512, 259, 4]) from checkpoint, the shape in current model is torch.Size([512, 18, 4]).

And the command I am using is:

python demo.py --cfg ./configs/config_h3d_stage3.yaml --example ./demos/t2m.txt

@santurini have you solved this problem?

zrx-python commented 6 months ago

Have you solved the dimension problem? Can you teach me the method @duan348733684

zrx-python commented 6 months ago

Have you solved the dimension problem? Can you teach me the method @zsolt-csibi

ehsanwwe commented 3 months ago

@zsolt-csibi @duan348733684 @santurini @zrx-python have you solve 🤔?

zrx-python commented 3 months ago

@ehsanwwe 解决了 但是忘记什么原因了,好像是某个超参数 还是 数据集不对的问题,你排查排查把 https://blog.csdn.net/qq_53930200/article/details/137102979?spm=1001.2014.3001.5501

ehsanwwe commented 3 months ago

( learning Chinese Lang is much important than learning Programming language 😄) @zrx-python i do all titles available at the link but everything's are the same