nju-websoft / MBE

Inductive Knowledge Graph Reasoning for Multi-batch Emerging Entities, CIKM 2022
GNU General Public License v3.0
15 stars 5 forks source link

Entities in validation sets #2

Closed chjjma closed 1 year ago

chjjma commented 1 year ago

Hi.

First, thanks for the code. In the dataset section of the paper, one of the constraints for cleaning is: the set of entities of the validation set is a subset of the set of entities of the training set. However, for the datasets released, it seems that this constraint is not true. In WN-MBE, entity '1828267' is in the validation set, but not in the training set. In NELL-MBE, entity 'concept_athlete_jon_kitna' is in the validation set, but not in the training set. In FB-MBE, entity '/m/01pf6' is in the validation set, but not in the training set. It seems that the statistics provided in the paper is before cleaning. Could you provide the statistics after cleaning? Thanks in advance.

yncui-nju commented 1 year ago

Hi,

Thanks for your interest in our work! We have updated the dataset. There are 7636, 31406, and 18501 validation facts in WN-MBE, FB-MBE, and NELL-MBE, respectively.

chjjma commented 1 year ago

Thanks for the update! But I think there are still some entities not in the train set but in the entity set. For example, /m/05_p2 in FB-MBE's valid.triples but not in base_train.triples Thanks in advance.

yncui-nju commented 1 year ago

Thanks! These entities have been deleted.

chjjma commented 1 year ago

Thank you!