nju-websoft / MBE

Inductive Knowledge Graph Reasoning for Multi-batch Emerging Entities, CIKM 2022
GNU General Public License v3.0
15 stars 5 forks source link

About the dataset #5

Open wenwen19910224 opened 9 months ago

wenwen19910224 commented 9 months ago

Can you share your thoughts on the design of the dataset for this article?

yncui-nju commented 9 months ago

Hi, thank you for your interest in this work!

We constructed the dataset to primarily simulate three characteristics of real-world scenarios: (i) continuous growth in multiple batches; (ii) emerging entities are more likely to be few-shot, meaning they may only be present in a small number of facts; (iii) some emerging facts may have head and tail entities that are both unseen. As described in Section 4, the dataset is simulated to grow continuously, making popular entities more likely to be in the original KG, while entities with lower degrees are more likely to appear in new batches.