huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
https://huggingface.co/docs/diffusers
Apache License 2.0
26.3k stars 5.42k forks source link

Support Lightricks LTX-Video #9995

Open hlky opened 6 hours ago

hlky commented 6 hours ago

Lightricks LTX-Video

LTX-Video is the first DiT-based video generation model capable of generating high-quality videos in real-time. It produces 24 FPS videos at a 768x512 resolution faster than they can be watched. Trained on a large-scale dataset of diverse videos, the model generates high-resolution videos with realistic and varied content. We provide a model for both text-to-video as well as image+text-to-video usecases

Model Code

Integration should be fairly straight forward as the original code is based on Diffusers, some changes will be required like rewriting usage of einops.rearrange.