mkiol / dsnote

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Mozilla Public License 2.0
589 stars 20 forks source link

Request to add StyleTTS 2 model #183

Open Asten-Valentinus opened 1 week ago

Asten-Valentinus commented 1 week ago

Looking around, I've discovered the StyleTTS 2 model.

It seems to be of a much higher quality than other Voice Cloning TTS models, so I think it would be nice to support.

From the statistics mentioned in it's page, it seems to be a little more optimised than YourTTS, which is already supported by SpeechNote.

Alike to what SpeechNote does, it seems to work well when generated on a sentence-by-sentence basis. However, the quality degrades on smaller segments of text.

There is a sweet spot to maximize quality.

You can find the source here.

Thanks for the awesome program!

mkiol commented 5 days ago

Thanks for the recommendation. Adding to my TO-DO list.

BTW, Have you tested the Coqui XTTS model? In my opinion it is great, much better than YourTTS. Especially in version 2.0.2.

Asten-Valentinus commented 4 days ago

Thanks for the recommendation. Adding to my TO-DO list.

BTW, Have you tested the Coqui XTTS model? In my opinion it is great, much better than YourTTS. Especially in version 2.0.2.

Yeah, I've tried it. It is certainly much better! I think Style TTS is higher quality than that though, the demos show it as pretty good.