Open JonathanJao opened 2 years ago
Hi @JonathanJao
Thanks for raising this issue!
For these tasks, please try adding ignore_verifications=True
to the load_dataset function. E.g., dataset = load_dataset("kilt_tasks", "wow", ignore_verifications=True)
. This will skip the checksum verification phase during dataset loading.
We suspect that some dataset owners have updated their files, and this makes the checksum in huggingface datasets outdated. (See #2) Unfortunately we don't have control over this. You may get data samples that are slightly different from our original paper, but we expect the impact of this to be small.
Let me know if some tasks are still not working / missing.
-Qinyuan
Hi, so after resolving the encoding issue, I'm still getting a few errors with the most recent code on the following datasets:
When I try to run a few of them, they output the following:
Running a curl on the URL yields:
On the README file it says that Google Drive has a quota for daily download, but this error message looks like there may be something else going on.