其他数据集训练出错

Yuknoshita commented 3 years ago

我按照您对数据的处理（nytAndWebnlg），应用在其他数据集上：我猜测您dataloader之中 ` subj = entity[subj_idx]

        obj = entity[obj_idx]

        rc_head_labels+=[subj['start'], obj['start'], re['type']]

        rc_tail_labels+=[subj['end']-1, obj['end']-1, re['type']]

的含义是实体变成[1, 1, 'None', 16, 20, 'None']，两个数字是实体的单词起始和结束下标，None是类型关系是：rc_head_labels = [1, 1, '/location/location/contains]，rc_tail_labels= [16, 20, '/location/location/contains]`，即头实体和尾实体的单词下标对和关系类型。

我想请教的是：

我对于数据的处理是否理解正确？
实体的类型为None或者实际的类型有什么区别吗？
在训练中，进行forward()的时候，出现了维度不匹配的问题,这是什么原因呢？应当如何解决？

Traceback (most recent call last): File "B:\work\pycharm\PYCHARM\PyCharm Community Edition 2020.3.4\plugins\python-ce\helpers\pydev\pydevd.py", line 1483, in _exec pydev_imports.execfile(file, globals, locals) # execute the script File "B:\work\pycharm\PYCHARM\PyCharm Community Edition 2020.3.4\plugins\python-ce\helpers\pydev_pydev_imps_pydev_execfile.py", line 18, in execfile exec(compile(contents+"\n", file, 'exec'), glob, loc) File "B:/model/PFN-nested/main.py", line 197, in ner_pred, re_head_pred, re_tail_pred = model(text, mask) File "B:\work\anaconda\envs\pfn\lib\site-packages\torch\nn\modules\module.py", line 1051, in _call_impl return forward_call(input, kwargs) File "B:\model\PFN-nested\model\pfn.py", line 260, in forward x = self.bert(x)[0] File "B:\work\anaconda\envs\pfn\lib\site-packages\torch\nn\modules\module.py", line 1051, in _call_impl return forward_call(input, *kwargs) File "B:\work\anaconda\envs\pfn\lib\site-packages\transformers\models\bert\modeling_bert.py", line 989, in forward past_key_values_length=past_key_values_length, File "B:\work\anaconda\envs\pfn\lib\site-packages\torch\nn\modules\module.py", line 1051, in _call_impl return forward_call(input, **kwargs) File "B:\work\anaconda\envs\pfn\lib\site-packages\transformers\models\bert\modeling_bert.py", line 221, in forward embeddings += position_embeddings RuntimeError: The size of tensor a (588) must match the size of tensor b (512) at non-singleton dimension 1

Process finished with exit code 1

Coopercoppers commented 3 years ago

rc_head_label应该是subj_head, obj_head, relation。你这上边写的1，1是subj_head, subj_tail，这是不对的。
不需要对实体进行类型划分的就设成None
这个可能是你的句子tokenize之后长度超过了512，检查下你的数据，看看哪个超了。

Yuknoshita commented 3 years ago

好的，感谢您的指导

------------------ 原始邮件 ------------------ 发件人: "Coopercoppers/PFN" @.>; 发送时间: 2021年10月30日(星期六) 下午4:02 @.>; @.**@.>; 主题: Re: [Coopercoppers/PFN] 其他数据集训练出错 (Issue #5)

rc_head_label应该是subj_head, obj_head, relation。你这上边写的1，1是subj_head, subj_tail，这是不对的。

不需要对实体进行类型划分的就设成None

这个可能是你的句子tokenize之后长度超过了512，检查下你的数据，看看哪个超了。

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android.

Yuknoshita commented 3 years ago

不好意思再次打扰您。我按照您说的检查出错的句子的长度，发现我一个批次的句子都没有超过128的，但是tokenizer(x)后num_tokens=588，确实大于512. 我不是很明白tokenizer的原理，好像我将批次降低之后他仍然会超过。这个bug改了一下午了，很是苦恼，烦请您指导一下该从哪里改，谢谢

------------------ 原始邮件 ------------------ 发件人: "Coopercoppers/PFN" @.>; 发送时间: 2021年10月30日(星期六) 下午4:24 @.>; @.**@.>; 主题: Re: [Coopercoppers/PFN] 其他数据集训练出错 (Issue #5)

Closed #5.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android.

Coopercoppers commented 3 years ago

你把tokenizer后长度为588的那个句子拿出来打印一下再分析。

Yuknoshita commented 3 years ago

这个我之前已经试过了，很奇怪的是，我的tokenizer()是对一个字符list进行的，最后得到了一个一维的输出，就是这个588的向量，在我的理解中，tokenizer应该是对每个句子进行的吧。。。可能之前的数据处理还是有问题，谢谢您的回复

------------------ 原始邮件 ------------------ 发件人: "Coopercoppers/PFN" @.>; 发送时间: 2021年10月30日(星期六) 晚上6:07 @.>; @.**@.>; 主题: Re: [Coopercoppers/PFN] 其他数据集训练出错 (Issue #5)

你把tokenizer后长度为588的那个句子拿出来打印一下再分析。

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android.

Coopercoppers / PFN

其他数据集训练出错 #5