How to split train and test set?

tkipf / ica

Python implementation of the Iterative Classification Algorithm

MIT License

35 stars 22 forks source link

Open Gingersna opened 5 years ago

Gingersna commented 5 years ago

In these datasets, node in the train set may be related to the node in test/val set, so I want to know how to sample to avoid data leakage

tkipf commented 5 years ago

As far as I know these train/val/test splits should be disjunct. Why do you think that there is an overlap?