MycroftAI / mimic3

A fast local neural text to speech engine for Mycroft
GNU Affero General Public License v3.0
1.08k stars 103 forks source link

[SSML] Wrong settings used for speech synthesis #27

Open roehrt opened 1 year ago

roehrt commented 1 year ago

Describe the bug Mimic3 ignores <prosody> settings and instead applies the settings of the last closed <prosody> block instead.

To Reproduce mimic3 '<prosody rate="200%">This should be spoken fast but is not.</prosody><prosody volume="30%">This should be a bit quieter but is actually spoken faster</prosody>' --ssml | aplay

Expected behavior Mimic3 should speak the first sentence fast and the second one with lowered volume.

Environment  - Device type: desktop  - OS: Ubuntu 22.04

Source of actual behavior https://github.com/MycroftAI/mimic3/blob/be72c185e471e3ef939147679df9e1d00262c513/mimic3_tts/tts.py#L470-L501

stephenrt42 commented 1 year ago

Try this; `

This should be spoken fast but is not.This should be a bit quieter but is actually spoken faster

`