The following Python code tries to download a private dataset and fails with the error datasets.exceptions.DatasetNotFoundError: Dataset 'ClimatePolicyRadar/all-document-text-data-weekly' doesn't exist on the Hub or cannot be accessed.. Downloading a public dataset doesn't work.
from datasets import load_dataset
_ = load_dataset("ClimatePolicyRadar/all-document-text-data-weekly")
This seems to be just an issue with my machine config as the code above works with a colleague's machine. So far I have tried:
logging back out and in from the Huggingface CLI using huggingface-cli logout
manually removing the token cache at /Users/kalyan/.cache/huggingface/token (found using huggingface-cli env)
manually passing a token in load_dataset
My output of huggingface-cli whoami:
kdutia
orgs: ClimatePolicyRadar
Steps to reproduce the bug
python
Python 3.12.2 (main, Feb 6 2024, 20:19:44) [Clang 15.0.0 (clang-1500.1.0.2.5)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> from datasets import load_dataset
>>> _ = load_dataset("ClimatePolicyRadar/all-document-text-data-weekly")
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/Users/kalyan/Library/Caches/pypoetry/virtualenvs/open-data-cnKQNmjn-py3.12/lib/python3.12/site-packages/datasets/load.py", line 2074, in load_dataset
builder_instance = load_dataset_builder(
^^^^^^^^^^^^^^^^^^^^^
File "/Users/kalyan/Library/Caches/pypoetry/virtualenvs/open-data-cnKQNmjn-py3.12/lib/python3.12/site-packages/datasets/load.py", line 1795, in load_dataset_builder
dataset_module = dataset_module_factory(
^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/kalyan/Library/Caches/pypoetry/virtualenvs/open-data-cnKQNmjn-py3.12/lib/python3.12/site-packages/datasets/load.py", line 1659, in dataset_module_factory
raise e1 from None
File "/Users/kalyan/Library/Caches/pypoetry/virtualenvs/open-data-cnKQNmjn-py3.12/lib/python3.12/site-packages/datasets/load.py", line 1597, in dataset_module_factory
raise DatasetNotFoundError(f"Dataset '{path}' doesn't exist on the Hub or cannot be accessed.") from e
datasets.exceptions.DatasetNotFoundError: Dataset 'ClimatePolicyRadar/all-document-text-data-weekly' doesn't exist on the Hub or cannot be accessed.
>>>
Describe the bug
The following Python code tries to download a private dataset and fails with the error
datasets.exceptions.DatasetNotFoundError: Dataset 'ClimatePolicyRadar/all-document-text-data-weekly' doesn't exist on the Hub or cannot be accessed.
. Downloading a public dataset doesn't work.This seems to be just an issue with my machine config as the code above works with a colleague's machine. So far I have tried:
huggingface-cli logout
/Users/kalyan/.cache/huggingface/token
(found usinghuggingface-cli env
)load_dataset
My output of
huggingface-cli whoami
:Steps to reproduce the bug
Expected behavior
The dataset downloads successfully.
Environment info
From
huggingface-cli env
:from
datasets-cli env
: