gmftbyGMFTBY / Copyisallyouneed

[ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM
https://openreview.net/forum?id=CROlOA9Nd8C&referrer=%5Bthe%20profile%20of%20Tian%20Lan%5D(%2Fprofile%3Fid%3D~Tian_Lan7)
MIT License
182 stars 22 forks source link

question about preprocess data #7

Open bnb-xiaoli opened 1 year ago

bnb-xiaoli commented 1 year ago

Hello, I am trying to preprocess a private Chinese dataset, and the following error occurs when I process the data according to data/readme.md.

[Errno 2] No such file or directory: 'dpr_chunk_0_0.pt'
bnb-xiaoli commented 1 year ago

Would you like to provide instructions on how to pre-process Chinese data?