[rank0]: Shape mismatch, 70200 != 64800

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Apache License 2.0

9.34k stars 879 forks source link

[rank0]: Shape mismatch, 70200 != 64800 #475

Open mikeyimer opened 2 weeks ago

mikeyimer commented 2 weeks ago

System Info / 系統信息

text to Video, the default configuration, only the model path has been changed

Information / 问题信息

[X] The official example scripts / 官方的示例脚本
[X] My own modified scripts / 我自己修改的脚本和任务

Reproduction / 复现过程

text to Video, the default configuration, only the model path has been changed

Expected behavior / 期待表现

text to Video, the default configuration, only the model path has been changed

zRzRzRzRzRzRzR commented 2 weeks ago

Can you check the latest main branch? I believe test_inference has not been uploaded, and using the new inference directly in the current main branch should work.

mikeyimer commented 2 weeks ago

Can you check the latest main branch? I believe test_inference has not been uploaded, and using the new inference directly in the current main branch should work.

It worked after the update, but out of vram in last step

zRzRzRzRzRzRzR commented 2 weeks ago

Did the out-of-memory error occur at 10 seconds, but not at 5 seconds? I encountered this problem while running it today as well. Next week, our submission of the diffusers version will solve this problem well. We have been working on saving GPU memory.