Open mfaisal opened 5 years ago
ERROR - transformers.configuration_utils - Model name 'roberta-base-uncased' was not found in model name list (bert-base-uncased, bert-large-uncased, bert-base-cased, bert-large-cased, bert-base-multilingual-uncased, bert-base-multilingual-cased, bert-base-chinese, bert-base-german-cased, bert-large-uncased-whole-word-masking, bert-large-cased-whole-word-masking, bert-large-uncased-whole-word-masking-finetuned-squad, bert-large-cased-whole-word-masking-finetuned-squad, bert-base-cased-finetuned-mrpc). We assumed 'roberta-base-uncased' was a path or url but couldn't find any file associated to this path or url.
I am trying to use xlnet base but getting an error:
learner = BertLearner.from_pretrained_model(databunch, args.model_name, metrics=metrics, device=device, logger=logger, output_dir=args.output_dir, finetuned_wgts_path=None, warmup_steps=10, multi_gpu=args.multi_gpu, is_fp16=True, multi_label=True, logging_steps=0)
10/18/2019 00:36:04 - INFO - transformers.file_utils - https://s3.amazonaws.com/models.huggingface.co/bert/xlnet-base-cased-config.json not found in cache or force_download set to True, downloading to /tmp/tmprecagwg5 100%|██████████| 641/641 [00:00<00:00, 177099.59B/s] 10/18/2019 00:36:04 - INFO - transformers.file_utils - copying /tmp/tmprecagwg5 to cache at /root/.cache/torch/transformers/c9cc6e53904f7f3679a31ec4af244f4419e25ebc8e71ebf8c558a31cbcf07fc8.ef1824921bc0786e97dc88d55eb17aabf18aac90f24bd34c0650529e7ba27d6f 10/18/2019 00:36:04 - INFO - transformers.file_utils - creating metadata file for /root/.cache/torch/transformers/c9cc6e53904f7f3679a31ec4af244f4419e25ebc8e71ebf8c558a31cbcf07fc8.ef1824921bc0786e97dc88d55eb17aabf18aac90f24bd34c0650529e7ba27d6f 10/18/2019 00:36:04 - INFO - transformers.file_utils - removing temp file /tmp/tmprecagwg5 10/18/2019 00:36:04 - INFO - transformers.configuration_utils - loading configuration file https://s3.amazonaws.com/models.huggingface.co/bert/xlnet-base-cased-config.json from cache at /root/.cache/torch/transformers/c9cc6e53904f7f3679a31ec4af244f4419e25ebc8e71ebf8c558a31cbcf07fc8.ef1824921bc0786e97dc88d55eb17aabf18aac90f24bd34c0650529e7ba27d6f 10/18/2019 00:36:04 - INFO - transformers.configuration_utils - Model config { "attn_type": "bi", "bi_data": false, "clamp_len": -1, "d_head": 64, "d_inner": 3072, "d_model": 768, "dropout": 0.1, "end_n_top": 5, "ff_activation": "gelu", "finetuning_task": null, "initializer_range": 0.02, "layer_norm_eps": 1e-12, "mem_len": null, "n_head": 12, "n_layer": 12, "n_token": 32000, "num_labels": 13, "output_attentions": false, "output_hidden_states": false, "pruned_heads": {}, "reuse_len": null, "same_length": false, "start_n_top": 5, "summary_activation": "tanh", "summary_last_dropout": 0.1, "summary_type": "last", "summary_use_proj": true, "torchscript": false, "untie_r": true, "use_bfloat16": false }
10/18/2019 00:36:04 - INFO - transformers.file_utils - https://s3.amazonaws.com/models.huggingface.co/bert/xlnet-base-cased-pytorch_model.bin not found in cache or force_download set to True, downloading to /tmp/tmpgq8bybxi 100%|██████████| 467042463/467042463 [00:35<00:00, 13098175.50B/s] 10/18/2019 00:36:40 - INFO - transformers.file_utils - copying /tmp/tmpgq8bybxi to cache at /root/.cache/torch/transformers/24197ba0ce5dbfe23924431610704c88e2c0371afa49149360e4c823219ab474.7eac4fe898a021204e63c88c00ea68c60443c57f94b4bc3c02adbde6465745ac 10/18/2019 00:36:42 - INFO - transformers.file_utils - creating metadata file for /root/.cache/torch/transformers/24197ba0ce5dbfe23924431610704c88e2c0371afa49149360e4c823219ab474.7eac4fe898a021204e63c88c00ea68c60443c57f94b4bc3c02adbde6465745ac 10/18/2019 00:36:42 - INFO - transformers.file_utils - removing temp file /tmp/tmpgq8bybxi 10/18/2019 00:36:42 - INFO - transformers.modeling_utils - loading weights file https://s3.amazonaws.com/models.huggingface.co/bert/xlnet-base-cased-pytorch_model.bin from cache at /root/.cache/torch/transformers/24197ba0ce5dbfe23924431610704c88e2c0371afa49149360e4c823219ab474.7eac4fe898a021204e63c88c00ea68c60443c57f94b4bc3c02adbde6465745ac
ImportError Traceback (most recent call last)