NVIDIA / NeMo-text-processing

NeMo text processing for ASR and TTS
https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/stable/nlp/text_normalization/wfst/wfst_text_normalization.html
Apache License 2.0
242 stars 76 forks source link

French normalizer crashes #163

Closed lbehringer closed 2 months ago

lbehringer commented 2 months ago

Describe the bug

Instantiating a Normalizer object with lang set to "fr" leads to a crash after waiting for 2-3 min.

Steps/Code to reproduce bug

Please list minimal steps or code snippet for us to be able to reproduce the bug.

>>> from nemo_text_processing.text_normalization import Normalizer
>>> norm = Normalizer(input_case="cased", lang="fr")
 NeMo-text-processing :: INFO     :: Creating ClassifyFst grammars. This might take some time...
Killed

Expected behavior

A Normalizer object for normalizing French text would be instantiated.

Environment overview (please complete the following information)

Environment details

Additional context

It works fine for other languages (English, Italian, German, Spanish).

ekmb commented 2 months ago

@mgrafu could you take a look please?

mgrafu commented 2 months ago

We tried replicating this issue, but cannot replicate. The normalizer object is instantiated correctly on our end. You could try using pynini==2.1.5