Closed mfbalin closed 1 month ago
@frozenbugs
@Rhett-Ying
@mfbalin thanks for reporting this.
@az15240 could you help look into this? please try to reproduce in your local first.
@mfbalin which example exactly? please list the single-gpu one and multi-gpu one in the description.
Updated the description
@az15240 This could be the most possible culprit. please check whether the GraphBolt's BuiltinDataset processes in the same way as DGL.
@az15240 This could be the most possible culprit. please check whether the GraphBolt's BuiltinDataset processes in the same way as DGL.
This is very likely the cause. I'll update the dataset.
The dataset is updated. Running python examples/graphbolt/node_classification.py --dataset=ogbn-arxiv
will produce an accuracy close to 70%. Please follow up for any questions!
The dataset is updated. Running
python examples/graphbolt/node_classification.py --dataset=ogbn-arxiv
will produce an accuracy close to 70%. Please follow up for any questions!
Can you mention what changes you have made compared to the previous dataset? @az15240
@az15240 This could be the most possible culprit. please check whether the GraphBolt's BuiltinDataset processes in the same way as DGL.
I added bidirectional edges and self loops to the graphbolt dataset, as mentioned in this reply.
🐛 Bug
When we run our examples with the ogbn-arxiv BuiltinDataset, the accuracy numbers we get are far below the multigpu dgl example.
To Reproduce
Steps to reproduce the behavior:
DGL comparison from regression:
Test name: multi_gpu.bench_dgl_multigpu_node_classification.track_acc 'ogbn-products' | 'cpu-cuda' | '0' | 77.79 'ogbn-products' | 'cpu-cuda' | '0,1' | 77.02 'ogbn-products' | 'cpu-cuda' | '0,1,2,3' | 75.07 'ogbn-products' | 'cpu-cuda' | '0,1,2,3,4,5,6,7' | 73.19 'ogbn-arxiv' | 'cpu-cuda' | '0' | 70.11 'ogbn-arxiv' | 'cpu-cuda' | '0,1' | 69.23 'ogbn-arxiv' | 'cpu-cuda' | '0,1,2,3' | 69.05 'ogbn-arxiv' | 'cpu-cuda' | '0,1,2,3,4,5,6,7' | 67.19
Expected behavior
A test accuracy closer to 70% is expected while the current accuracy is below 55%.
Environment
conda
,pip
, source):Additional context