Open yjhong89 opened 2 weeks ago
Same Error in patch_size: [rank0]: self.spatial_length = latent_width * latent_height // reduce(mul, patch_size[1:]) [rank0]: TypeError: 'int' object is not subscriptable
Another Error is: [rank0]: File "CogVideo/sat/sgm/util.py", line 261, in instantiate_from_config [rank0]: return get_obj_from_str(config["target"])(config.get("params", dict()), extra_kwargs) [rank0]: TypeError: dit_video_concat.Basic3DPositionEmbeddingMixin() got multiple values for keyword argument 'height_interpolation'
Same problem after merging CogVideo1.5X branch. It seems that this update cause the prolems:https://github.com/THUDM/CogVideo/commit/3a9af5bdd937faaa7803914aa94e384d7f40af67#diff-7b0a094155a48dc8761cbfd20bdf64fe0cc9021873b7ab23cea5a5cae38a670e
Same problem after merging CogVideo1.5X branch. It seems that this update cause the prolems:3a9af5b#diff-7b0a094155a48dc8761cbfd20bdf64fe0cc9021873b7ab23cea5a5cae38a670e
Hi, did you solve it? Could you tell me how to modify the code to run? Thanks a lot.
@yjhong89 Hi, have a simple question, how to prepare dataset for image to video (I2V) fine-tuning, thanks.
System Info / 系統信息
When running
sample_video.py
with CogVideoX-5B-I2V model, error raised.since patch_size in config is integer, error occured in here: https://github.com/THUDM/CogVideo/blob/e2987ff565703953b34749db2d1053e26bba2e2c/sat/dit_video_concat.py#L664
after fixing patch_size as [2,2,2] following
load_checkpoint
error occuredInformation / 问题信息
Reproduction / 复现过程
sample_video.py
with CogVideoX-5B-I2V modelExpected behavior / 期待表现