LTX-Video is the first DiT-based video generation model capable of generating high-quality videos in real-time. It produces 24 FPS videos at a 768x512 resolution faster than they can be watched. Trained on a large-scale dataset of diverse videos, the model generates high-resolution videos with realistic and varied content. We provide a model for both text-to-video as well as image+text-to-video usecases
Integration should be fairly straight forward as the original code is based on Diffusers, some changes will be required like rewriting usage of einops.rearrange.
Lightricks LTX-Video
LTX-Video is the first DiT-based video generation model capable of generating high-quality videos in real-time. It produces 24 FPS videos at a 768x512 resolution faster than they can be watched. Trained on a large-scale dataset of diverse videos, the model generates high-resolution videos with realistic and varied content. We provide a model for both text-to-video as well as image+text-to-video usecases
Model Code
Integration should be fairly straight forward as the original code is based on Diffusers, some changes will be required like rewriting usage of
einops.rearrange
.