showlab / all-in-one

[CVPR2023] All in One: Exploring Unified Video-Language Pre-training
https://arxiv.org/abs/2203.07303
277 stars 16 forks source link

401 Client Error: Unauthorized for url: https://huggingface.co/pretrained/bert-base-uncased/resolve/main/vocab.txt #11

Closed wonzin closed 1 year ago

wonzin commented 1 year ago

$ python run.py with data_root=/datasets/msvd/data num_gpus=2 num_nodes=1 num_frames=3 per_gpu_batchsize=16 task_finetune_msvdqa load_path="pretrained/all-in-one-base.ckpt"

WARNING - root - Changed type of config entry "max_steps" from int to NoneType WARNING - AllInOne - No observers have been added to this run INFO - AllInOne - Running command 'main' INFO - AllInOne - Started Global seed set to 0 INFO - lightning - Global seed set to 0

ERROR - AllInOne - Failed after 0:00:01! Traceback (most recent calls WITHOUT Sacred internals): File "run.py", line 15, in main dm = MTDataModule(_config, dist=True) File "/home/miniconda3/envs/allinone/lib/python3.7/site-packages/pytorch_lightning/core/datamodule.py", line 49, in call obj = type.call(cls, *args, kwargs) File "/home/all-in-one/AllInOne/datamodules/multitask_datamodule.py", line 19, in init self.dm_dicts = {key: _datamoduleskey for key in datamodule_keys} File "/home/all-in-one/AllInOne/datamodules/multitask_datamodule.py", line 19, in self.dm_dicts = {key: _datamoduleskey for key in datamodule_keys} File "/home/miniconda3/envs/allinone/lib/python3.7/site-packages/pytorch_lightning/core/datamodule.py", line 49, in call obj = type.call(cls, *args, *kwargs) File "/home/all-in-one/AllInOne/datamodules/msvdqa_datamodule.py", line 8, in init super().init(args, kwargs) File "/home/all-in-one/AllInOne/datamodules/datamodule_base.py", line 57, in init self.tokenizer = get_pretrained_tokenizer(tokenizer) File "/home/all-in-one/AllInOne/datamodules/datamodule_base.py", line 20, in get_pretrained_tokenizer from_pretrained, do_lower_case="uncased" in from_pretrained File "/home/miniconda3/envs/allinone/lib/python3.7/site-packages/transformers/tokenization_utils_base.py", line 1752, in from_pretrained raise err File "/home/miniconda3/envs/allinone/lib/python3.7/site-packages/transformers/tokenization_utils_base.py", line 1745, in from_pretrained use_auth_token=use_auth_token, File "/home/miniconda3/envs/allinone/lib/python3.7/site-packages/transformers/file_utils.py", line 1056, in cached_path local_files_only=local_files_only, File "/home/miniconda3/envs/allinone/lib/python3.7/site-packages/transformers/file_utils.py", line 1186, in get_from_cache r.raise_for_status() File "/home/miniconda3/envs/allinone/lib/python3.7/site-packages/requests/models.py", line 953, in raise_for_status raise HTTPError(http_error_msg, response=self) requests.exceptions.HTTPError: 401 Client Error: Unauthorized for url: https://huggingface.co/pretrained/bert-base-uncased/resolve/main/vocab.txt


There are 2 reasons for 401 Client Error for hugging face transformer.

  1. The repository does not exist.
  2. The repository is private.

bert base uncased model is definitely not private. image.

How could I resolve this issue?

python == 3.7.13 transformers == 4.2.1

wonzin commented 1 year ago

https://huggingface.co/pretrained/bert-base-uncased/resolve/main/vocab.txt would be https://huggingface.co/bert-base-uncased/resolve/main/vocab.txt

AllInOne/config.py is modified. line 53 tokenizer = "pretrained/bert-base-uncased" tokenizer = "bert-base-uncased"

FingerRec commented 1 year ago

Thanks for the kindly feedback.