WuHuRestaurant / xf_event_extraction2020Top1

科大讯飞2020事件抽取挑战赛第一名解决方案&完整事件抽取系统
536 stars 123 forks source link

ERNIE1.0模型 #17

Closed VonnOoO closed 3 years ago

VonnOoO commented 3 years ago

test.sh运行后读取ERNIE模型,我在https://github.com/nghuyong/ERNIE-Pytorch下载了ernie-1.0 (Chinese)模型,并将解压到文件放到bert/torch_ernie_1/下面,运行还是报错。请问你是在哪里下载的ERNIE模型,并做了什么处理呢?谢谢

WuHuRestaurant commented 3 years ago

Ernie文件夹目录看下

VonnOoO commented 3 years ago

xf_master bert |---torch_roberta_wwm |---torch_ernie_1 |-------config.json |-------pytorch_model.bin |-------bocab.txt

WuHuRestaurant commented 3 years ago

不应该是vocab.txt吗?还有你报的是啥错。。

VonnOoO commented 3 years ago

vocab.txt。报错如下: 02/04/2021 08:39:59 - INFO - transformers.tokenization_utils - Model name '../bert/torch_ernie_1' not found in model shortcut name list (bert-base-uncased, bert-large-uncased, bert-base-cased, bert-large-cased, bert-base-multilingual-uncased, bert-base-multilingual-cased, bert-base-chinese, bert-base-german-cased, bert-large-uncased-whole-word-masking, bert-large-cased-whole-word-masking, bert-large-uncased-whole-word-masking-finetuned-squad, bert-large-cased-whole-word-masking-finetuned-squad, bert-base-cased-finetuned-mrpc, bert-base-german-dbmdz-cased, bert-base-german-dbmdz-uncased, bert-base-finnish-cased-v1, bert-base-finnish-uncased-v1, bert-base-dutch-cased). Assuming '../bert/torch_ernie_1' is a path, a model identifier, or url to a directory containing tokenizer files. 02/04/2021 08:39:59 - INFO - transformers.tokenization_utils - Didn't find file ../bert/torch_ernie_1/added_tokens.json. We won't load it. 02/04/2021 08:39:59 - INFO - transformers.tokenization_utils - Didn't find file ../bert/torch_ernie_1/special_tokens_map.json. We won't load it. 02/04/2021 08:39:59 - INFO - transformers.tokenization_utils - Didn't find file ../bert/torch_ernie_1/tokenizer_config.json. We won't load it. 02/04/2021 08:39:59 - INFO - transformers.tokenization_utils - loading file ../bert/torch_ernie_1/vocab.txt 02/04/2021 08:39:59 - INFO - transformers.tokenization_utils - loading file None 02/04/2021 08:39:59 - INFO - transformers.tokenization_utils - loading file None 02/04/2021 08:39:59 - INFO - transformers.tokenization_utils - loading file None 02/04/2021 08:39:59 - INFO - transformers.configuration_utils - loading configuration file ../bert/torch_ernie_1/config.json 02/04/2021 08:39:59 - INFO - transformers.configuration_utils - Model config BertConfig { "attention_probs_dropout_prob": 0.1, "hidden_act": "relu", "hidden_dropout_prob": 0.1, "hidden_size": 768, "initializer_range": 0.02, "intermediate_size": 3072, "layer_norm_eps": 1e-05, "max_position_embeddings": 513, "model_type": "bert", "num_attention_heads": 12, "num_hidden_layers": 12, "pad_token_id": 0, "type_vocab_size": 2, "vocab_size": 18000 }

02/04/2021 08:39:59 - INFO - transformers.modeling_utils - loading weights file ../bert/torch_ernie_1/pytorch_model.bin Traceback (most recent call last): File "test.py", line 291, in ensemble_attribution(args.version) File "/home/fk/event_extraction_exp/xf/ensemble_predict.py", line 144, in ensemble_attribution all_polarity_logits /= float(count) TypeError: unsupported operand type(s) for /=: 'NoneType' and 'float'

WuHuRestaurant commented 3 years ago

你模型训练了吗。。上来就test

VonnOoO commented 3 years ago

4个模型训练了的,也改了模型的路径。这个执行了前面的抽取,然后执行到了ensemble attribution时,加载ernie后报错,我换过几个ernie都报错,就是不确定是不是这个的问题

WuHuRestaurant commented 3 years ago

这个代码的原因是执行 ensemble_attribution 时126行 ernie_models_path 没有读取到对应的模型路径,你可以打印出来 debug 看一下

VonnOoO commented 3 years ago

请问你的ernie是在哪儿下载的呢,目录结果是一样的?

WuHuRestaurant commented 3 years ago

一样的地方,你ernie能训练为啥加载的时候有问题

VonnOoO commented 3 years ago

谢谢提醒,我试试先换用这个ernie训练attribution模型

ztclear commented 3 years ago

xf_master bert |---torch_roberta_wwm |---torch_ernie_1 |-------config.json |-------pytorch_model.bin |-------bocab.txt

请问这个torch_roberta_wwm在哪里下载 我也遇到了 Model name './bert/torch_roberta_wwm' was not found in tokenizers model name list (bert-base-uncased,。。。 这个问题,请问你解决了嘛

WuHuRestaurant commented 3 years ago

xf_master bert |---torch_roberta_wwm |---torch_ernie_1 |-------config.json |-------pytorch_model.bin |-------bocab.txt

请问这个torch_roberta_wwm在哪里下载 我也遇到了 Model name './bert/torch_roberta_wwm' was not found in tokenizers model name list (bert-base-uncased,。。。 这个问题,请问你解决了嘛

wwm这个模型的文件夹里面要有 vocab.txt 才行,有Bert的词表才能创建 tokenizer