Questions about the GOOD-motif dataset

Hi Yangyang,

Thank you for your questions.

The id_val_dataset is the in-domain split in GOOD-Motif which shares similar distribution with the training set. In contrast, the val_dataset is the out-of-domain split in GOOD-Motif that consists of different distributions from the training set, i.e., different base graphs.
The train_dataset.data.env_id indicates the environment labels of samples in the dataset, which serves the same purpose as the environment partitions in Invariant Risk Minimization (IRM). For more information, please refer to our paper.

Please let me know if you have any further questions! :smile:

Best regards, Shurui Gui

divelab / GOOD