-
### Model/Pipeline/Scheduler description
Applying pretrained Text-to-Video (T2V) Diffusion models to Image-to-video (I2V) generation tasks using SDEdit often results in low source image fidelity in…
-
Error occurred when executing DownloadAndLoadMimicMotionModel:
Cannot load from D:\ComfyUI_windows_portable\ComfyUI\models\diffusers\stable-video-diffusion-img2vid-xt-1-1 because the following keys…
-
### Checklist
- [X] The issue exists after disabling all extensions
- [X] The issue exists on a clean installation of webui
- [ ] The issue is caused by an extension, but I believe it is caused b…
-
Hi thanks for this great library!
There seems to be some diffusion models that generate text, instead of images. (For example, these two surveys: https://arxiv.org/abs/2303.06574, https://www.seman…
-
Thanks for sharing this work firstly.
I test this code with a reference code, but I got a results as not I expected. As concerned as the similariy it's far away from InstantID performance.
Furtherm…
-
Could you provide an example on how to use this method for video inference?
-
Thanks for the great work! Would you mind share the system requirement to run inference? Can I run it on free google colab T4 gpu with 15G GPU RAM?
-
# Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
> Sora unveils the potential of scaling Diffusion Transformer (DiT) for gener…
-
Excellent work!
Amazing LipVoicer!
I have a small question about the evaluation metric of sync: LSE-C and LSE-D.
In [LIPVOICER: GENERATING SPEECH FROM SILENT VIDEOS GUIDED BY LIP READING](https…
-
Hello,
I encountered an issue when using this plugin. The "video_frame" and "video_mask" directories are created under the project_dir as expected. The "video_frame" directory contains disassembled v…