NVIDIA / NeMo-text-processing

NeMo text processing for ASR and TTS
https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/stable/nlp/text_normalization/wfst/wfst_text_normalization.html
Apache License 2.0
242 stars 76 forks source link

Armenian TN #137

Closed davidks13 closed 3 months ago

davidks13 commented 5 months ago

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Before your PR is "Ready for review"

Pre checks:

PR Type:

If you haven't finished some of the above items you can still open "Draft" PR.

davidks13 commented 4 months ago

Hi @ekmb . What should I do to pass this check ''continuous-integration/jenkins/pr-head''? I could not open details of that check. Thank you!

tbartley94 commented 4 months ago

Hi @ekmb . What should I do to pass this check ''continuous-integration/jenkins/pr-head''? I could not open details of that check. Thank you!

That's CI on our end. Can't really access if you're not a maintainer. We have to check and provide feedback. Sorry, can be a bit annoying.

tbartley94 commented 4 months ago

@Ara-Yeroyan are there additional issues with this or can I close out reviewing?

Ara-Yeroyan commented 4 months ago

Everything is okay now! The pynini behaviour was different on Windows (docker) and in linux. We have checked with @davidks13.

davidks13 commented 4 months ago

Hi @tbartley94 . Are there any additional issues with the code I need to check?

Ara-Yeroyan commented 4 months ago

Actually there are issues (no handling) with Roman Numbers and the range like numbers - e.g. 26-27

davidks13 commented 4 months ago

Actually there are issues (no handling) with

This is a base for Armenian TN. Those features can be added in the future.

tbartley94 commented 4 months ago

@davidks13 you're good on my technical review. There's a CI issue that requires me to test on local, so the delay is me doing some san testing. I'll be merging later in the week.

@Ara-Yeroyan Roman and ranges are more complex features that are implemented after base TN. Those can be disregarded.

tbartley94 commented 3 months ago

jenkins