ModelsLab / diffusers_plus_plus

Diffusers++: State-of-the-art diffusion models for image and audio generation in PyTorch
https://huggingface.co/docs/diffusers
Apache License 2.0
9 stars 2 forks source link

Add Lumina - Transforming Text into Any Modality #9

Open shauray8 opened 1 month ago

shauray8 commented 1 month ago

a series of text-conditioned Diffusion Transformers (DiT) capable of transforming textual descriptions into vivid images, dynamic videos, detailed multi-view 3D images, and synthesized speech.

Code - https://github.com/Alpha-VLLM/Lumina-T2X

shauray8 commented 1 month ago

there's a lot of residual noise reported with the model, needs to be tested before addition