Open alexfredo opened 9 months ago
How did you do that? I can't even generate it right
@YooZF I choose "svd" for the model and put 1 for the parameter "Decode t frames at a time" In general I need to try at least 3 differents seed before get a correct result sometime nothing happen and after I changed the seed I got a good result but there's often something weird about the reflections
Can you provide some log output? My terminal shut down after showing the following, I'm not sure if it was out of memory that caused it to shut down automatically.
`PS F:\shared_pc3\generative-models-main> streamlit run f:/shared_pc3/generative-models-main/scripts/demo/video_sampling.py
You can now view your Streamlit app in your browser.
Local URL: http://localhost:8501/ Network URL: http://192.168.168.158:8501/
E:\anaconda\envs\svd\lib\site-packages\streamlit\watcher\local_sources_watcher.py:177: UserWarning: Torchaudio's I/O functions now support par-call bakcend dispatch. Importing backend implementation directly is no longer guaranteed to work. Please use backend keyword with load/save/info function, instead of calling the udnerlying implementation directly. lambda m: [p for p in m.path._path], VideoTransformerBlock is using checkpointing VideoTransformerBlock is using checkpointing VideoTransformerBlock is using checkpointing VideoTransformerBlock is using checkpointing VideoTransformerBlock is using checkpointing VideoTransformerBlock is using checkpointing VideoTransformerBlock is using checkpointing VideoTransformerBlock is using checkpointing VideoTransformerBlock is using checkpointing VideoTransformerBlock is using checkpointing VideoTransformerBlock is using checkpointing VideoTransformerBlock is using checkpointing VideoTransformerBlock is using checkpointing VideoTransformerBlock is using checkpointing VideoTransformerBlock is using checkpointing VideoTransformerBlock is using checkpointing Initialized embedder #0: FrozenOpenCLIPImagePredictionEmbedder with 683800065 params. Trainable: False Initialized embedder https://github.com/Stability-AI/generative-models/pull/1: ConcatTimestepEmbedderND with 0 params. Trainable: False Initialized embedder https://github.com/Stability-AI/generative-models/issues/2: ConcatTimestepEmbedderND with 0 params. Trainable: False Initialized embedder https://github.com/Stability-AI/generative-models/pull/3: VideoPredictionEmbedderWithEncoder with 83653863 params. Trainable: False Initialized embedder https://github.com/Stability-AI/generative-models/issues/4: ConcatTimestepEmbedderND with 0 params. Trainable: False Loading model from checkpoints/svd.safetensors PS F:\shared_pc3\generative-models-main>`
@NicerY Here my log :
A matching Triton is not available, some optimizations will not be enabled.
Error caught was: No module named 'triton'
VideoTransformerBlock is using checkpointing
VideoTransformerBlock is using checkpointing
VideoTransformerBlock is using checkpointing
VideoTransformerBlock is using checkpointing
VideoTransformerBlock is using checkpointing
VideoTransformerBlock is using checkpointing
VideoTransformerBlock is using checkpointing
VideoTransformerBlock is using checkpointing
VideoTransformerBlock is using checkpointing
VideoTransformerBlock is using checkpointing
VideoTransformerBlock is using checkpointing
VideoTransformerBlock is using checkpointing
VideoTransformerBlock is using checkpointing
VideoTransformerBlock is using checkpointing
VideoTransformerBlock is using checkpointing
VideoTransformerBlock is using checkpointing
Initialized embedder #0: FrozenOpenCLIPImagePredictionEmbedder with 683800065 params. Trainable: False
Initialized embedder #1: ConcatTimestepEmbedderND with 0 params. Trainable: False
Initialized embedder #2: ConcatTimestepEmbedderND with 0 params. Trainable: False
Initialized embedder #3: VideoPredictionEmbedderWithEncoder with 83653863 params. Trainable: False
Initialized embedder #4: ConcatTimestepEmbedderND with 0 params. Trainable: False
Loading model from checkpoints/svd_image_decoder.safetensors
2023-11-23 01:11:17.828 Uncaught app exception
Traceback (most recent call last):
File "C:\generative-models.pt2\Lib\site-packages\streamlit\runtime\scriptrunner\script_runner.py", line 534, in _run_script
exec(code, module.dict)
File "C:\generative-models\scripts\demo\video_sampling.py", line 142, in
@NicerY When there's a memory error it's write CUDA out of memory like with stable diffusion
what's difference between svd_image_decoder.safetensors and svd_xt.safetensors?
@alexfredo Can I ask about the vertical video? Did you just forced different resolution and it worked out of the box? There's some info that it was trained specifically on 1024x576 Is anything else needed? They suggest in the code to increase the augmentation conditioning, but you result looks quite good, so wonder if any other changes were necessary
I made few tests with SVD and I noticed that reflections seems to stay too much at the same place, I'm not sure because I don't have generated long enough videos, did someone else noticed problem with relfections not reacting correctly ? here some tests I made :
https://github.com/Stability-AI/generative-models/assets/24534698/3b102ae4-39ee-4e7e-9478-e96bff609e0e
https://github.com/Stability-AI/generative-models/assets/24534698/98bd3a11-5767-493d-8036-9832f564ecf8
https://github.com/Stability-AI/generative-models/assets/24534698/c53dc9cf-9b35-42d3-ade6-0e9a99d8b42d