Closed Doreenruirui closed 7 years ago
The paper mentioned that each sentence will be padded by and <\s>. Should we process the data and save the file with each sentence in a line?
Yes. Here's a sentence splitter you can use: https://github.com/moses-smt/mosesdecoder/blob/master/scripts/ems/support/split-sentences.perl
The paper mentioned that each sentence will be padded by
and <\s>. Should we process the data and save the file with each sentence in a line?