rhasspy / piper

A fast, local neural text to speech system
https://rhasspy.github.io/piper-samples/
MIT License
4.38k stars 297 forks source link

Different tones on different parts of the text #490

Open porky11 opened 2 weeks ago

porky11 commented 2 weeks ago

It would be nice if it was possible to add some specific tones to single words or whole sentences.

For example:

(also all combinations, like whispering sarcasm)

Is this already possible somehow? I saw espeak generates emphasis markers anyway, but maybe this could be altered manually in some way?

Or I could probably train and use different variants of some voice. But it seems it's not possible to switch voices without causing pauses, even when setting "sentence_silence" to 0. But this would probably be the best workaround so far.

It would still be nice if such a feature existed, preferably without the need of training new voices (if possible).

Daburnell112 commented 6 hours ago

I second this. Some sort of markup language would be nice if it exists.