NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
https://docs.nvidia.com/nemo-framework/user-guide/latest/overview.html
Apache License 2.0
11.66k stars 2.44k forks source link

Punctuation and Capitalization Model: how to add custom Punctuation marks to prepare data script? #10677

Open ican24 opened 20 hours ago

ican24 commented 20 hours ago

Dear Team,

Please help me to add custom Punctuation marks to following script.

python examples/nlp/token_classification/data/prepare_data_for_punctuation_capitalization.py \ -s <PATH/TO/THE/SOURCE/FILE> \ -o <PATH/TO/THE/OUTPUT/DIRECTORY> -p ? --marks ?

I can not add. Thank you in advance!