Open lemo2012 opened 6 years ago
Should I covert the "train","dev", "vocab", and "test" dataset to bpe format first if I want to set "--subword_option=bpe"?
@lemo2012 Yes, you should convert all of them to bpe format.
See this script for how to convert data to the format and generate the vocab file.
Should I covert the "train","dev", "vocab", and "test" dataset to bpe format first if I want to set "--subword_option=bpe"?