Open Cabeda opened 7 months ago
@p0n1 do you have time to give your thoughts on this PR?
@p0n1 do you have time to give your thoughts on this PR?
Hi @Cabeda Thank you for the great work. I just had a surgery and am still recovering at hospital. Will review the code whenever I feel better.
No probs! Hope for the best 💪🏼
Have you tried building the Docker image from the docker file using this? I checked out your repository but apparently its missing gcc and the rust compiler. I think another image is needed to install TTS in the docker image
As the title says this PR adds a new provider supporting the Coqui TTS.
The default model, Tacotron2, works very similar to EdgeTTS although it only has a single voice option for now. The power of this provider is the possibility of supporting multiple open TTS models with some very powerful like jenny.
Another interesting feature is voice dubbing with the likes of XTTS V2. There's a bug on sentences longer than 400 tokens for now though. To support voice dubbing I've added a folder with 3 voice samples and defaulted to the male one. Additionally, in this mode multiple languages are supported. As the options are different than the ones on
--language
I've added a new option named--coqui_language
.For this version the provider supports the same audio formats as edgeTTS thanks to pydub.
Note: To run coqui TTS it will always download the AI model to run. This can go from a few MB to more than 1 GB