Closed padeoe closed 8 months ago
Through debugging, I found a potential solution is to modify the code in the error handling module of huggingface_hub
: https://github.com/huggingface/huggingface_hub/commit/56d6c798c44e83d2a3167e74c022737d8fcbe822
@Wauplin
Thanks for investigating and reporting the bug @padeoe! I've opened a PR in huggingface_hub
with your suggested fix! :) https://github.com/huggingface/huggingface_hub/pull/2119
Describe the bug
This bug is triggered under the following conditions:
bookcorpus
,gsm8k
,wikipedia
, rather than in the form ofA/B
.HF_ENDPOINT
is set and the hostname is not in the form of(hub-ci.)?huggingface.co
.datasets>2.15.0
orhuggingface-hub>0.19.4
. For example, using the latest versions:datasets==2.18.0
andhuggingface-hub==0.21.4
,Steps to reproduce the bug
the issue can be reproduced with the following code:
console output:
Expected behavior
The dataset was downloaded correctly without any errors.
Environment info
datasets==2.18.0 huggingface-hub==0.21.4