gitmylo / audio-webui

A webui for different audio related Neural Networks
MIT License
973 stars 90 forks source link

[FEATURE REQUEST] Add VITS2 support #128

Open Subarasheese opened 10 months ago

Subarasheese commented 10 months ago

Please add support to this implementation of VITS2:

https://github.com/p0p4k/vits2_pytorch/

There are English and Russian weights in that repository already. Even though the voices have no "emotion", the speech sounds natural and it hallucinates way, way less than Bark, Tacotron and similar things I tried in the past. Plus it's blazing fast.

And most importantly, it includes training code.