snap-stanford / ogb

Benchmark datasets, data loaders, and evaluators for graph machine learning
https://ogb.stanford.edu
MIT License
1.89k stars 398 forks source link

Is node2vec trained using edges which include the test edges in MLP? #112

Closed 2023research closed 3 years ago

2023research commented 3 years ago

In linkPrediction example like ppa, the embeddings of nodes are firstly learned by node2vec and then used to predict links by MLP. Is the node2vec trained using all edges? If yes, this would contaminate the test data which is predicted by MLP (data leakage issue).

I have checked the edges of test data used by MLP model. It shows that the edges in test data are not included in the training data used by node2vec. Can you please confirm this?