Closed tzemicheal closed 4 months ago
The issue is caused due to the non-determinist part introduced by dgl.dataloading.DataLoader
when used along neighbor dgl.dataloading.MultiLayerNeighborSampler
sampling instead of full sampling. There are two ways to fix this (which both of them fixed the issue)
dataloading.DataLoader
ref: https://discuss.dgl.ai/t/reproducibility-dataloader-shuffle-true-using-seeds/4275/2 .Change the neighbor sampling to dgl.dataloading.MultiLayerFullNeighborSampler
for the inference part. This has minimal effect on small graphs, for large graphs, it might be slower neighbor sampling. This fix uses this approach.
Ref:
/merge
Description
Fix the issues by updating the sampler during inference to full sampling from subsampling.
Closes #1676
By Submitting this PR I confirm: