Open okulovsky opened 1 week ago
StyleTTS may replace TortoiseTTS.
https://github.com/yl4579/StyleTTS2 https://huggingface.co/spaces/styletts2/styletts2
The voice quality is very good, it's less resource-intense and more stable than TortoiseTTS.
It also supports emotions, so voice's samples can be generated from different emotions and then VITS would train on them as if on different voices.
To proceed, integration of StyleTTS into BrainBox is needed
To clean up voices from imperfect sources, this might be used https://huggingface.co/spaces/ResembleAI/resemble-enhance
To train VITS model of a character in another language:
Ideas/links/reviews on everything related to the voice training