AILab-CVC / VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
https://ailab-cvc.github.io/videocrafter2/
Other
4.59k stars 343 forks source link

missing keys in checkpoint of base_512_v1 #53

Open MightyChaos opened 11 months ago

MightyChaos commented 11 months ago

Hi authors, thanks so much for releasing the model. I run into issue when trying to load the checkpoint of lower-resolution t2v model, i.e. "base_512_v1". When I load it using the provided text2video script, I run into

RuntimeError: Error(s) in loading state_dict for LatentDiffusion: Missing key(s) in state_dict: "scale_arr", "model.diffusion_model.fps_embedding.0.weight", ...... ".

F.Y.I. I have no problem loading the other two models, i.e. "base_1024_v1" and "i2v_512_v1". Thank you again for helping address this issue!

k-sobolev commented 11 months ago

I experience the same issue with "base_512_v1" checkpoint.