Open psycalc opened 3 years ago
v3 now supports multiple languages. A voice trained in English can somewhat also speak another language. Though more monotonally.
That article is super old. The v3 model now uses a slightly custom tweaked VITS/YourTTS model. Tweaks including larger capacity, bigger lang embedding, custom symbol set (a custom spec of ARPAbet with some more phonemes to cover other languages), and I guess a different training script.
Guide plі in which (direction) neuron network (neural framework) you use and why? in wich direction should I look, in order too make voices more realistic, and sound in other languages? Is it possible at all or it is very complex and hard to train network?