Open OnceJune opened 11 months ago
the same question
We are also very excited to see the new version of VITS!
The VITS2 say it can make fully end-to-end TTS training and inference, without the TTS frontend which transfer text into phoneme sequence. It is means that, for Mandarin, we can input Chinese Characters directly, instead of Pinyin, I doute how much samples do we need then, Because there are so much Characters. Far more then the number of Pinyins.
Hi, VITS2 has been released at:https://arxiv.org/pdf/2307.16430.pdf, do you have the plan to release the code?