Alpha-VLLM / Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation
MIT License
1.82k stars 74 forks source link

what is the funciton of learn_sigma #56

Closed zhangyongshun closed 2 weeks ago

zhangyongshun commented 2 weeks ago

hi, I find that in the NextDiT model, the learn_sigma is set to True as default, which will double the out_channels and then return the half of them at the end of foward. How does this help to training. Is there any document for it?