Retrieve common entities between wn18rr and FB15k237 datasets

raphaelsty / mkb

Knowledge Base Embedding By Cooperative Knowledge Distillation

65 stars 3 forks source link

Retrieve common entities between wn18rr and FB15k237 datasets #2

Closed raphaelsty closed 4 years ago

raphaelsty commented 4 years ago

I need to retrieve common entities between both files:

The best solution in my opinion is to add Part-of-speech tagging to the entities of the fb15k237 dataset.

I need to retrieve common relations between both files:

The relations of the datasets fb15k237 and wn18rr are quite distinct symbolically nevertheless there is a semantic link between the relations of the two datasets.

raphaelsty commented 4 years ago

There are 1552 common entities between WN18RR and FB15K237.

WN18RR shares 3.79% of its entities with FB15K237.

FB15K327 shares 10.88% of its entities with WN18RR.

raphaelsty commented 4 years ago

In FB15K237, there are 13411 training samples (e_i, r, e_j) where e_i and e_j are defined in WN18RR. It represent 4.92% of the training set of FB15K237.

4.69% for the valid set of FB15K237
4.59% for the test set of FB15K237

raphaelsty commented 4 years ago

I uploaded a new version of wn18rr and fb15k237.

I have aligned the entities of fb15k237 with the entities of wn18rr and I have concatenated the set of relations of both datasets.

New fb15k237

New wn18rr