k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
https://k2-fsa.github.io/sherpa/onnx/index.html
Apache License 2.0
3.11k stars 359 forks source link

tts: Increase TTS volume to 0 DB. #851

Open domasofan opened 4 months ago

domasofan commented 4 months ago

Hi all,

This seems to affect tts in general. Tested on Android 14 and Windows 11. the generated speech seems to be very quiet and it seems to be under 0 db. According to my normalizer it was at -8.31 DB using the de_DE-thorsten-high model.

I don't know if it is the model or if it is the tts engine. If it is the tts engine would it be possible to raise the volume to a louder volume so it can be understood well?

Greetings and thanks, Simon

malnvenshorn commented 3 months ago

I agree, the voice is very quiet. A volume slider next to the speed slider would be nice, e.g. to adjust the volume from 0% to 200%.

domasofan commented 3 months ago

And also a command line argument for the cli users. Prevering the cli here because of much more versatility.