Closed shauray8 closed 4 days ago
Adds Hunyuan DiT which consists of adding a new DiT block with no Adaptive layer norm rather some neat integrations to the pipeline itself.
The model seems to be a little under trained: the architecture on the other hand looks really good
Fixes #12
What does this PR do?
Adds Hunyuan DiT which consists of adding a new DiT block with no Adaptive layer norm rather some neat integrations to the pipeline itself.
[ ] SDXL VAE is too 'compressed' which makes it harder to finetune maybe transfer to SD1/2 latent space while training
The model seems to be a little under trained: the architecture on the other hand looks really good
Fixes #12