huggingface / datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
https://huggingface.co/docs/datasets
Apache License 2.0
19.31k stars 2.7k forks source link

404 Client Error: Not Found for url: https://huggingface.co/api/models/bert-large-cased #5832

Closed varungupta31 closed 1 year ago

varungupta31 commented 1 year ago

Describe the bug

Running Bert-Large-Cased model causes HTTPError, with the following traceback-

HTTPError                                 Traceback (most recent call last)
<ipython-input-6-5c580443a1ad> in <module>
----> 1 tokenizer = BertTokenizer.from_pretrained('bert-large-cased')

~/miniconda3/envs/cmd-chall/lib/python3.7/site-packages/transformers/tokenization_utils_base.py in from_pretrained(cls, pretrained_model_name_or_path, *init_inputs, **kwargs)
   1646             # At this point pretrained_model_name_or_path is either a directory or a model identifier name
   1647             fast_tokenizer_file = get_fast_tokenizer_file(
-> 1648                 pretrained_model_name_or_path, revision=revision, use_auth_token=use_auth_token
   1649             )
   1650             additional_files_names = {

~/miniconda3/envs/cmd-chall/lib/python3.7/site-packages/transformers/tokenization_utils_base.py in get_fast_tokenizer_file(path_or_repo, revision, use_auth_token)
   3406     """
   3407     # Inspect all files from the repo/folder.
-> 3408     all_files = get_list_of_files(path_or_repo, revision=revision, use_auth_token=use_auth_token)
   3409     tokenizer_files_map = {}
   3410     for file_name in all_files:

~/miniconda3/envs/cmd-chall/lib/python3.7/site-packages/transformers/file_utils.py in get_list_of_files(path_or_repo, revision, use_auth_token)
   1685         token = None
   1686     model_info = HfApi(endpoint=HUGGINGFACE_CO_RESOLVE_ENDPOINT).model_info(
-> 1687         path_or_repo, revision=revision, token=token
   1688     )
   1689     return [f.rfilename for f in model_info.siblings]

~/miniconda3/envs/cmd-chall/lib/python3.7/site-packages/huggingface_hub/hf_api.py in model_info(self, repo_id, revision, token)
    246         )
    247         r = requests.get(path, headers=headers)
--> 248         r.raise_for_status()
    249         d = r.json()
    250         return ModelInfo(**d)

~/miniconda3/envs/cmd-chall/lib/python3.7/site-packages/requests/models.py in raise_for_status(self)
    951 
    952         if http_error_msg:
--> 953             raise HTTPError(http_error_msg, response=self)
    954 
    955     def close(self):

HTTPError: 404 Client Error: Not Found for url: https://huggingface.co/api/models/bert-large-cased

I have also tried running in offline mode, as discussed here

HF_DATASETS_OFFLINE=1 
TRANSFORMERS_OFFLINE=1

Steps to reproduce the bug

  1. from transformers import BertTokenizer, BertModel
  2. tokenizer = BertTokenizer.from_pretrained('bert-large-cased')

Expected behavior

Run without the HTTP error.

Environment info

# Name Version Build Channel
_libgcc_mutex 0.1 main
_openmp_mutex 4.5 1_gnu
_pytorch_select 0.1 cpu_0
appdirs 1.4.4 pypi_0 pypi
backcall 0.2.0 pypi_0 pypi
blas 1.0 mkl
bzip2 1.0.8 h7b6447c_0
ca-certificates 2021.7.5 h06a4308_1
certifi 2021.5.30 py37h06a4308_0
cffi 1.14.6 py37h400218f_0
charset-normalizer 2.0.3 pypi_0 pypi
click 8.0.1 pypi_0 pypi
colorama 0.4.4 pypi_0 pypi
cudatoolkit 11.1.74 h6bb024c_0 nvidia
cycler 0.11.0 pypi_0 pypi
decorator 5.0.9 pypi_0 pypi
docker-pycreds 0.4.0 pypi_0 pypi
docopt 0.6.2 pypi_0 pypi
dominate 2.6.0 pypi_0 pypi
ffmpeg 4.3 hf484d3e_0 pytorch
filelock 3.0.12 pypi_0 pypi
fonttools 4.38.0 pypi_0 pypi
freetype 2.10.4 h5ab3b9f_0
gitdb 4.0.7 pypi_0 pypi
gitpython 3.1.18 pypi_0 pypi
gmp 6.2.1 h2531618_2
gnutls 3.6.15 he1e5248_0
huggingface-hub 0.0.12 pypi_0 pypi
humanize 3.10.0 pypi_0 pypi
idna 3.2 pypi_0 pypi
importlib-metadata 4.6.1 pypi_0 pypi
intel-openmp 2019.4 243
ipdb 0.13.9 pypi_0 pypi
ipython 7.25.0 pypi_0 pypi
ipython-genutils 0.2.0 pypi_0 pypi
jedi 0.18.0 pypi_0 pypi
joblib 1.0.1 pypi_0 pypi
jpeg 9b h024ee3a_2
jsonpickle 1.5.2 pypi_0 pypi
kiwisolver 1.4.4 pypi_0 pypi
lame 3.100 h7b6447c_0
lcms2 2.12 h3be6417_0
ld_impl_linux-64 2.35.1 h7274673_9
libffi 3.3 he6710b0_2
libgcc-ng 9.3.0 h5101ec6_17
libgomp 9.3.0 h5101ec6_17
libiconv 1.15 h63c8f33_5
libidn2 2.3.2 h7f8727e_0
libmklml 2019.0.5 0
libpng 1.6.37 hbc83047_0
libstdcxx-ng 9.3.0 hd4cf53a_17
libtasn1 4.16.0 h27cfd23_0
libtiff 4.2.0 h85742a9_0
libunistring 0.9.10 h27cfd23_0
libuv 1.40.0 h7b6447c_0
libwebp-base 1.2.0 h27cfd23_0
lz4-c 1.9.3 h2531618_0
matplotlib 3.5.3 pypi_0 pypi
matplotlib-inline 0.1.2 pypi_0 pypi
mergedeep 1.3.4 pypi_0 pypi
mkl 2020.2 256
mkl-service 2.3.0 py37he8ac12f_0
mkl_fft 1.3.0 py37h54f3939_0
mkl_random 1.1.1 py37h0573a6f_0
msgpack 1.0.2 pypi_0 pypi
munch 2.5.0 pypi_0 pypi
ncurses 6.2 he6710b0_1
nettle 3.7.3 hbbd107a_1
ninja 1.10.2 hff7bd54_1
nltk 3.8.1 pypi_0 pypi
numpy 1.19.2 py37h54aff64_0
numpy-base 1.19.2 py37hfa32c7d_0
olefile 0.46 py37_0
openh264 2.1.0 hd408876_0
openjpeg 2.3.0 h05c96fa_1
openssl 1.1.1k h27cfd23_0
packaging 21.0 pypi_0 pypi
pandas 1.3.1 pypi_0 pypi
parso 0.8.2 pypi_0 pypi
pathtools 0.1.2 pypi_0 pypi
pexpect 4.8.0 pypi_0 pypi
pickleshare 0.7.5 pypi_0 pypi
pillow 8.3.1 py37h2c7a002_0
pip 21.1.3 py37h06a4308_0
prompt-toolkit 3.0.19 pypi_0 pypi
protobuf 4.21.12 pypi_0 pypi
psutil 5.8.0 pypi_0 pypi
ptyprocess 0.7.0 pypi_0 pypi
py-cpuinfo 8.0.0 pypi_0 pypi
pycparser 2.20 py_2
pygments 2.9.0 pypi_0 pypi
pyparsing 2.4.7 pypi_0 pypi
python 3.7.10 h12debd9_4
python-dateutil 2.8.2 pypi_0 pypi
pytorch 1.9.0 py3.7_cuda11.1_cudnn8.0.5_0 pytorch
pytz 2021.1 pypi_0 pypi
pyyaml 5.4.1 pypi_0 pypi
readline 8.1 h27cfd23_0
regex 2022.10.31 pypi_0 pypi
requests 2.26.0 pypi_0 pypi
sacred 0.8.2 pypi_0 pypi
sacremoses 0.0.45 pypi_0 pypi
scikit-learn 0.24.2 pypi_0 pypi
scipy 1.7.0 pypi_0 pypi
sentry-sdk 1.15.0 pypi_0 pypi
setproctitle 1.3.2 pypi_0 pypi
setuptools 52.0.0 py37h06a4308_0
six 1.16.0 pyhd3eb1b0_0
smmap 4.0.0 pypi_0 pypi
sqlite 3.36.0 hc218d9a_0
threadpoolctl 2.2.0 pypi_0 pypi
tk 8.6.10 hbc83047_0
tokenizers 0.10.3 pypi_0 pypi
toml 0.10.2 pypi_0 pypi
torchaudio 0.9.0 py37 pytorch
torchvision 0.10.0 py37_cu111 pytorch
tqdm 4.61.2 pypi_0 pypi
traitlets 5.0.5 pypi_0 pypi
transformers 4.9.1 pypi_0 pypi
typing-extensions 3.10.0.0 hd3eb1b0_0
typing_extensions 3.10.0.0 pyh06a4308_0
urllib3 1.26.14 pypi_0 pypi
wandb 0.13.10 pypi_0 pypi
wcwidth 0.2.5 pypi_0 pypi
wheel 0.36.2 pyhd3eb1b0_0
wrapt 1.12.1 pypi_0 pypi
xz 5.2.5 h7b6447c_0
zipp 3.5.0 pypi_0 pypi
zlib 1.2.11 h7b6447c_3
zstd 1.4.9 haebb681_0
varungupta31 commented 1 year ago

moved to https://github.com/huggingface/transformers/issues/23233