模型中是如何获取头尾实体，交互只是预训练模型的注意力矩阵吗？如何输出保存最好的模型呢，我跑出的结果相差很大

wtangdev / UniRel

released code for our EMNLP22 paper: UniRel: Unified Representation and Interaction for Joint Relational Triple Extraction

Apache License 2.0

82 stars 17 forks source link

模型中是如何获取头尾实体，交互只是预训练模型的注意力矩阵吗？如何输出保存最好的模型呢，我跑出的结果相差很大 #16

Closed yxy0102 closed 6 months ago

yxy0102 commented 9 months ago

wtangdev commented 9 months ago

你好，在 https://github.com/wtangdev/UniRel/blob/1dbc2e72d0ed1a8eeb418883dd8dbb0cd797876d/run.py#L340C17-L340C32 中会保存dev集上最好结果的checkpoint。

效果不好，请确认下使用的是否的是正确的数据集，test_data_tyep = unirel_span对应的是nyt数据集（而不是nyt_star）

yxy0102 commented 9 months ago

非常感谢您的回复，我想知道test_data_tyep的参数有哪几个呢，分别代表什么意思呀，我看运行出来输出也不一样

? @.***

------------------ 原始邮件 ------------------ 发件人: "wtangdev/UniRel" @.>; 发送时间: 2024年1月6日(星期六) 晚上10:52 @.>; @.**@.>; 主题: Re: [wtangdev/UniRel] 模型中是如何获取头尾实体，交互只是预训练模型的注意力矩阵吗？如何输出保存最好的模型呢，我跑出的结果相差很大 (Issue #16)

你好，在 https://github.com/wtangdev/UniRel/blob/1dbc2e72d0ed1a8eeb418883dd8dbb0cd797876d/run.py#L340C17-L340C32 中会保存dev集上最好结果的checkpoint。

效果不好，请确认下使用的是否的是正确的数据集，test_data_tyep = unirel_span对应的是nyt数据集（而不是nyt_star）

— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: @.***>

lljhaha13 commented 8 months ago

我按照正确的方式下载webnlg数据并训练，报如下错误： embeddings = inputs_embeds + token_type_embeddings RuntimeError: The size of tensor a (334) must match the size of tensor b (322) at non-singleton dimension 1 这是为什么，数据集用的是readme.md所提供的。

wtangdev commented 7 months ago

抱歉回复的很晚。

在 run.py line 30: DataProcessorDict = { "nyt_all_sa": UniRelDataProcessor, "unirel_span": UniRelDataProcessor } 可以看到只有nyt_all_sa/unirel_span两种，其中nyt_all_sa是适合 nyt/webnlg的，unirel_span是适合nyt/webnlg的。（也抱歉命名容易产生误解）

wtangdev commented 7 months ago

我按照正确的方式下载webnlg数据并训练，报如下错误： embeddings = inputs_embeds + token_type_embeddings RuntimeError: The size of tensor a (334) must match the size of tensor b (322) at non-singleton dimension 1 这是为什么，数据集用的是readme.md所提供的。

可以看看是不是没有区分WebNLG和WebNLG*数据集。