-
Would it be possible to share the dataset of sign language poses that's used during pre-training and fine-tuning? I understand that the raw version is more than 500 GB, so perhaps, the quantized versi…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues and checked the recent builds/commits
### What would your feature do?
Stability AI just dropped their new mo…
-
Hello!
I'm looking into adding diffusion based image generation into a video game for fairly simple stuff like decals. The game will be running under some pretty heavy resource constraints. If I …
-
Hi, I would like to ask about the GPU memories reported in [Line 118 of 01672dc](https://github.com/Drexubery/ViewCrafter/blob/01672dcb3efda1f39b7ea1fc5d6da100a4b14c26/README.md?plain=1#L118), which …
-
I watched all your videos and followed along, it tooks about 5 days 😀, it's very fun and appreciate you!
Now I wonder how to train this model.
I also watched another video of yours “How diffusion…
-
-
Experiencing severe face distortion with image to video, here is an example:
![977](https://github.com/user-attachments/assets/991e0a16-9d77-4106-bc09-0989f3493a2f)
https://github.com/user-attachm…
-
Hi Team,
Found your work amazing,
I am trying to get this model inference running on google Colab,
after installing the requirements, dependencies
When I run
!python src/inference.py --input_vide…
-
### Model/Pipeline/Scheduler description
The authors propose a novel inference technique based on a pretrained diffusion model for text-conditional video generation. Their approach, called FIFO-Diffu…
-