Closed realisticdreamer114514 closed 2 months ago
The I2V model will generally do exactly that if anything but 49 frames is used.
The I2V model will generally do exactly that if anything but 49 frames is used.
This is the case, thanks for pointing out.
System Info / 系統信息
diffusers 0.30.3 on cuda 12.4 & python 3.11 in conda venv of windows
Information / 问题信息
Reproduction / 复现过程
and reduce the number of frames to 41 (for 5 seconds)
python cli_demo.py --prompt "A female basketball player is standing in a basketball court, her body leaning forward towards the camera. She is wearing a vibrant blue basketball jersey with the number 3 prominently displayed. The person's head is tilted back, and her hands are clasped together in front of her legs. She is reaching for the viewpoint and waving at it closely. The court beneath her is a rich brown color with green wall in the background and a basketball hoop stands against the wall." --model_path "D:\CogVideoX-5b-I2V" --generate_type "i2v" --output_path ./output.mp4 --image_or_video_path "D:\test\process\2.png" --dtype float16
Expected behavior / 期待表现
A normal video is produced