Hi, thanks for sharing the tool. And I have a question.
I used this command, but it didn't achieve the effect I wanted. So, I want to know how to use the file normalize-punctuation.perl.
NORM_PUNC=/mosesdecoder/scripts/tokenizer/normalize-punctuation.perlperl ${NORM_PUNC} -l en < ${data_dir}/en.txt > ${data_dir}/norm_en.txt
The sentences in the two documents are the same.
However,the imaging process will in-evitably be affected。
The Chinese punctuation marks didn't change to English punctuation marks.
Hi, thanks for sharing the tool. And I have a question. I used this command, but it didn't achieve the effect I wanted. So, I want to know how to use the file normalize-punctuation.perl.
NORM_PUNC=/mosesdecoder/scripts/tokenizer/normalize-punctuation.perl
perl ${NORM_PUNC} -l en < ${data_dir}/en.txt > ${data_dir}/norm_en.txt
The sentences in the two documents are the same.
However,the imaging process will in-evitably be affected。
The Chinese punctuation marks didn't change to English punctuation marks.