google-research / true

Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".
Apache License 2.0
71 stars 10 forks source link

Checksum error (NLI_Fever) #2

Closed davidswelt closed 1 year ago

davidswelt commented 2 years ago

A freshly downloaded nli_fever.zip (dates from 2019) yielded this checksum error when downloading the datasets:

Downloading and preparing dataset fever/v1.0 [...]

...
  File "/usr/local/google/home/reitter/true/src/download_datasets.py", line 292, in get_fever_labels
    fever_data = load_dataset('fever', 'v1.0', split=data_split)
  File "/usr/local/google/home/reitter/.local/lib/python3.9/site-packages/datasets/load.py", line 1702, in load_dataset
    builder_instance.download_and_prepare(
  File "/usr/local/google/home/reitter/.local/lib/python3.9/site-packages/datasets/builder.py", line 594, in download_and_prepare
    self._download_and_prepare(
  File "/usr/local/google/home/reitter/.local/lib/python3.9/site-packages/datasets/builder.py", line 665, in _download_and_prepare
    verify_checksums(
  File "/usr/local/google/home/reitter/.local/lib/python3.9/site-packages/datasets/utils/info_utils.py", line 40, in verify_checksums
    raise NonMatchingChecksumError(error_msg + str(bad_urls))

datasets.utils.info_utils.NonMatchingChecksumError: Checksums didn't match for dataset source files:
['https://s3-eu-west-1.amazonaws.com/fever.public/train.jsonl', 'https://s3-eu-west-1.amazonaws.com/fever.public/shared_task_dev.jsonl', 'https://s3-eu-west-1.amazonaws.com/fever.public/shared_task_dev_public.jsonl', 'https://s3-eu-west-1.amazonaws.com/fever.public/shared_task_test.jsonl', 'https://s3-eu-west-1.amazonaws.com/fever.public/paper_dev.jsonl', 'https://s3-eu-west-1.amazonaws.com/fever.public/paper_test.jsonl']
roeeaharoni commented 2 years ago

Is this still happening? I tried to run it in a fresh folder and it didn't reproduce.