Closed wuzhi19931128 closed 6 days ago
Hi @wuzhi19931128 thanks for your interest. I am uploading it to huggingface, should be ready today.
@wuzhi19931128
the file is above the 300g limit of huggingface, so I current host it here https://storage.googleapis.com/tevatron-vision/wiki-ss-hf-data.tar
please download with this link via wget
and then load the data with datasets.load_from_disk
grateful!
How can I obtain the Wiki-ss dataset mentioned in the paper https://arxiv.org/pdf/2406.11251?