Closed Hannibal046 closed 1 year ago
Hi, tokenization seems to be ignored here. After unzip, there are only wiki.train.raw file https://github.com/princeton-nlp/TRIME/blob/2dfbdbd8fad0fe2fd54cf1232b8cdec1bf700ed7/get_data.sh#L12-L21
wiki.train.raw
sorry for bothering, I mistakenly download raw version
Hi, tokenization seems to be ignored here. After unzip, there are only
wiki.train.raw
file https://github.com/princeton-nlp/TRIME/blob/2dfbdbd8fad0fe2fd54cf1232b8cdec1bf700ed7/get_data.sh#L12-L21