ExpressAI / DataLab

The unified platform for data-related resources.
https://expressai.github.io/DataLab/
Apache License 2.0
131 stars 27 forks source link

coqa dataset is broken #399

Open neubig opened 1 year ago

neubig commented 1 year ago
>>> datalabs.load_dataset("coqa", "open_domain_question_answering")
Couldn't find a directory or a dataset named 'coqa' in this version. It was picked from the master branch on github instead.
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/load.py", line 2144, in load_dataset
    builder_instance.download_and_prepare(
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/builder.py", line 747, in download_and_prepare
    self._download_and_prepare(
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/builder.py", line 844, in _download_and_prepare
    split_generators = self._split_generators(dl_manager, **split_generators_kwargs)
  File "/Users/gneubig/.cache/expressai/modules/datasets_modules/datalab/coqa/6306f4340c9292e4899aa1b0f46a4bb13bf06ff186a4a8c7b11ddd9bc219595e/coqa.py", line 115, in _split_generators
    test_path = dl_manager.download_and_extract(_TEST_DOWNLOAD_URL)
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/download_manager.py", line 322, in download_and_extract
    return self.extract(self.download(url_or_urls))
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/download_manager.py", line 221, in download
    downloaded_path_or_paths = map_nested(
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/py_utils.py", line 297, in map_nested
    return function(data_struct)
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/download_manager.py", line 248, in _download
    return cached_path(url_or_filename, download_config=download_config)
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/file_utils.py", line 344, in cached_path
    output_path = get_from_cache(
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/file_utils.py", line 717, in get_from_cache
    raise FileNotFoundError(f"Couldn't find file at {url}")
FileNotFoundError: Couldn't find file at http://cdatalab1.oss-cn-beijing.aliyuncs.com/question_answering/ccks2018task4/test2.txt