Open TomSeldon opened 6 years ago
I have been bumbed that issue for bunch of times and I wanted to leave this comment. I don't know inside of this projecet but I've been searching this improving speech output via some emphasis work, for 6 weeks. I think it does not make sense to using ssml tags to make it better. For me, it was very hard to handling mapping the words and doing it in a programatic way. So I let it go, instead I looked for pitch corrrection, maybe applying auto tune would be better I tought but it seems it was not the case. Lately I'm looking in voiecelab and emphasis and emphassess to find a way apply something to tts output and make sound better.
Use speech synthesis markup (https://developer.amazon.com/public/solutions/alexa/alexa-skills-kit/docs/speech-synthesis-markup-language-ssml-reference) to improve the realism of the speech output.
e.g.
Need to try this against actual voice output instead of just testing the text response.