NVIDIA / NeMo-text-processing

NeMo text processing for ASR and TTS
https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/stable/nlp/text_normalization/wfst/wfst_text_normalization.html
Apache License 2.0
242 stars 76 forks source link

EN TN fixes for Issue #166 #185

Open zoobereq opened 2 weeks ago

zoobereq commented 2 weeks ago

What does this PR do ?

Fixes the issue where the sentence-final period in sentences ending with domain is incorrectly normalized as part of the domain. The PR also adds a handful of relevant tests and expands on the top-level domain mappings.

To note: A handful of Sparrowhawk tests failed for the current implementation of EN TN. This fix does not address those failures.

Before your PR is "Ready for review"

Pre checks:

PR Type:

If you haven't finished some of the above items you can still open "Draft" PR.