Closed MackNcD closed 10 months ago
The output is of lower quality than 11labs but you can train models to match prosody better. Then you can improve pitch and fidelity by running it through RVC which I go over a bit here https://youtu.be/IcpRfHod1ic?si=DzZmvIbYUWE6EXgQ
Question/issue/concern in title