Open phirsch opened 7 months ago
Ooh thanx so much for sharing this with me! I will look into it and consider integrating it if it's a good fit!
FYI: mkiol/dsnote/issues/122 might be relevant and unfortunately limit the usefulness of this model until huggingface/parler-tts/issues/11 is fixed/implemented.
Feel free to close the issue if you prefer.
Thanks again! I'll leave it open to remember to check out this project from time to time. <3
@FlorianEagox there are also other really cool TTS models you could implement if you ever get the chance to
And there is another new steerable open source model which looks promising (and even seems to support translation internally, but only EN/CN for now):
Honestly, i really loved the new GPTSoVits V2, it also has really fast generations
Just wanted to bring this new TTS library+model to your attention which allows voice characteristics to be steered via a separate prompt:
https://github.com/huggingface/parler-tts (impressive demos on the HF space linked there).
Afterthought: Wondering whether an LLM might be able to derive such prompts from a pure text transcript...