malllabiisc / CompGCN

ICLR 2020: Composition-Based Multi-Relational Graph Convolutional Networks
Apache License 2.0
597 stars 107 forks source link

The whole graph of the adjacency matrix created with the training set cannot contain some entities that have not appeared in the training set. #22

Closed WeidongLi-KG closed 4 years ago

WeidongLi-KG commented 4 years ago

Therefore, how can you guarantee that the output here gets the embedding vectors of all entities (include the unseen entities in the test or valid set)? image

Next, your indexing operation may cause problems. image

These places have caused me a lot of confusion, so I ask for answers.

soumyasanyal commented 4 years ago

Hi WeidongLi-KG,

Thanks for your interest in our project. So, if I understand correctly, you mean that our model cannot be used inductively as is for new entities unseen during train time. That is a correct observation. And the reason we don't face issues is the KG link prediction experiments in our paper are in a transductive setting (where all entities are guaranteed to be seen during the train time). Having said that, it should be possible to use our method in an inductive setting as well. Say there is a new node for which we don't know the embedding. But usually, we will be given the edge connections of the new node. We can use the trained CompGCN to get the embedding of the node from its neighborhood. This is a standard procedure in inductive GNN settings. Hope this helps with both your queries!