Closed EachSheep closed 3 months ago
Hi! Thanks for pointing it out. Due to the large file size, it is not efficient to upload it to the cloud, but I have put the processing script in https://github.com/OpenMatch/Augmentation-Adapted-Retriever/blob/main/tools/process_kilt_wikipedia.py
Hello, it seems that the corpus data
kilt_wikipedia
is missing in the data you provided at the link preprocessed data. However, you are referencing this data inpost_pipeline.sh
and I didn't see the method for creating this data in the document. Could you please provide this missing data?