Closed T0L0ve closed 6 months ago
This is likely due to incorrect frame output. Perhaps you could check if the output image shape is normal.
The output image shape is (320,576,3),I didn't change the input image and args in demo.py.
When I use fp16 I get an AttributeError: 'StableVideoDiffusionPipeline' object has no attribute 'dinov2'.
Maybe there's something wrong here?
if needs_upcasting: self.vae.to(dtype=torch.float16) self.dinov2.to(dtype=torch.float16)
dinov2
You can directly remove all the code related to Dino 2. The current version does not utilize Dino 2.
The output image shape is (320,576,3),I didn't change the input image and args in demo.py.
The output appears to be a single image, which is incorrect. The output should be a list of images.
The output image shape is (320,576,3),I didn't change the input image and args in demo.py.
The output appears to be a single image, which is incorrect. The output should be a list of images.
I just print one of the image list. The frames shape after decode_latents is torch.Size([1, 3, 20, 320, 576]).
frames = self.decode_latents(latents, num_frames, decode_chunk_size) print(frames.shape)
I find the image array is nan.
I solved the problem by change the torch version from 1.13 to 2.1.2
I use the fp16 model of svd_xt,but the output img is black image.
controlnet = DragAnythingSDVModel.from_pretrained(args["DragAnything"],local_files_only=True,torch_dtype=torch.float16)
unet = UNetSpatioTemporalConditionControlNetModel.from_pretrained(args["pretrained_model_name_or_path"],subfolder="unet",torch_dtype=torch.float16, variant="fp16",local_files_only=True)
pipeline = StableVideoDiffusionPipeline.from_pretrained(args["pretrained_model_name_or_path"],controlnet=controlnet,unet=unet,local_files_only=True,torch_dtype=torch.float16, variant="fp16")
one of the output is black gif,and I get an error EOFError: no more images in GIF file.