ExpressAI / DataLab

The unified platform for data-related resources.
https://expressai.github.io/DataLab/
Apache License 2.0
131 stars 27 forks source link

financial_phrasebank dataset is broken #402

Open neubig opened 1 year ago

neubig commented 1 year ago
Couldn't find a directory or a dataset named 'financial_phrasebank' in this version. It was picked from the master branch on github instead.
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/load.py", line 2144, in load_dataset
    builder_instance.download_and_prepare(
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/builder.py", line 747, in download_and_prepare
    self._download_and_prepare(
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/builder.py", line 844, in _download_and_prepare
    split_generators = self._split_generators(dl_manager, **split_generators_kwargs)
  File "/Users/gneubig/.cache/expressai/modules/datasets_modules/datalab/financial_phrasebank/ad8d820642ae40831f1ed3357ff9a50592fbe5bf61331a55e88b0072e1ba0d69/financial_phrasebank.py", line 150, in _split_generators
    data_dir = dl_manager.download_and_extract(_URL)
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/download_manager.py", line 322, in download_and_extract
    return self.extract(self.download(url_or_urls))
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/download_manager.py", line 221, in download
    downloaded_path_or_paths = map_nested(
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/py_utils.py", line 297, in map_nested
    return function(data_struct)
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/download_manager.py", line 248, in _download
    return cached_path(url_or_filename, download_config=download_config)
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/file_utils.py", line 344, in cached_path
    output_path = get_from_cache(
  File "/Users/gneubig/opt/anaconda3/envs/explainaboard_web/lib/python3.10/site-packages/datalabs/utils/file_utils.py", line 722, in get_from_cache
    raise ConnectionError(
ConnectionError: Couldn't reach https://www.researchgate.net/profile/Pekka_Malo/publication/251231364_FinancialPhraseBank-v10/data/0c96051eee4fb1d56e000000/FinancialPhraseBank-v10.zip (error 403)