k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
https://k2-fsa.github.io/sherpa/onnx/index.html
Apache License 2.0
3.61k stars 422 forks source link

Breaking Voice With Numbers, Giving A Gap #1514

Open studionexus-lk opened 1 week ago

studionexus-lk commented 1 week ago

Google's service, offered free of charge, instantly translates words, phrases, and web pages between English and over 100 other languages.

in above text why is that audio output generated from tts braking with the numbers

But in bellow, it play smoothly

Google's service, offered free of charge, instantly translates words, phrases, and web pages between English and over one hundred other languages.

studionexus-lk commented 1 week ago

Preview Audio Files

non-braking.zip

breaking.mp3


Google's service, offered free of charge, instantly translates words, phrases, and web pages between English and over 100 other languages. 

non-braking.mp3

Google's service, offered free of charge, instantly translates words, phrases, and web pages between English and over one hundred other languages.
csukuangfj commented 1 week ago

could you describe which model you are using and you use it?

The info you provided is toooo limited and we cannot help you.