myshell-ai / MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
MIT License
3.97k stars 473 forks source link

Can't Pronounce Numbers greater or equal to 10,000. #139

Closed ivanhe123 closed 2 weeks ago

ivanhe123 commented 1 month ago

I used whisper to recognize a audio with a person saying: By using the number 100,000 (pronounced as one hundred thousand in audio) as substitution, we got a number of 10,000 (pronounced ten thousand in the audio).

Then I plug the recognized text to melo tts. It says: By using the number 100,000 (pronounced as one hundred, zero, zero,zero) as substituion, we got a number of 10,000 (pronounced ten, zero,zero,zero).

What it should say is one hundred thousand for 100,000 and ten thousand for 10,000. But instead it pronounced it as if it was like 100, 000 as a number and another number seperated by a comma.

How do I fix this?

RedBluePrinter commented 2 weeks ago

Take a look at: https://github.com/myshell-ai/MeloTTSmelo/text/english_utils/number_norm.py

ivanhe123 commented 2 weeks ago

THX VERY MUCH!