Hello!
I see that FP16 training is default for this repo.
Had someone found out, how does FP32 or mixed precision training behave comparing to FP16?
Does full precision model sound better?
And another question
Can you suggest some SOTA tts models to compate to VITS?
I found Grad-TTS and DiffGan to be very interesting, started training them.
Hello! I see that FP16 training is default for this repo. Had someone found out, how does FP32 or mixed precision training behave comparing to FP16? Does full precision model sound better?
And another question Can you suggest some SOTA tts models to compate to VITS? I found Grad-TTS and DiffGan to be very interesting, started training them.
Thanks.