Closed eginhard closed 1 year ago
@eginhard thanks for addressing this! We are aware of this. for our initial German version we did not tackle this feature. However, if you want this for your tts model to adjust pauses based on space, be aware that when pronouncing neunhundertvierzigtausendsiebenhundertzweiundzwanzig , there are short pauses between neunhundert, vierzigtausend,
This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.
This issue was closed because it has been inactive for 7 days since being marked as stale.
Describe the bug
In German, numbers are currently normalized with spaces between each digit and unit, although these should normally be written without spaces. In TTS systems, this leads to unnatural pauses in the output.
Steps/Code to reproduce bug
Expected behavior
Output should be "achtzehn millionen neunhundertvierzigtausendsiebenhundertzweiundzwanzig" (spaces are introduced for millions and above).
Environment details
NVIDIA NeMo Text Processing 0.1.7rc0