dmlc / dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.
http://dgl.ai
Apache License 2.0
13.57k stars 3.02k forks source link

[Graphbolt][Dataset] IGB-HOM dataset wrong number of edges #7780

Open BowenYao18 opened 2 months ago

BowenYao18 commented 2 months ago

🔨Work Item

IMPORTANT:

Project tracker: https://github.com/orgs/dmlc/projects/2

Description

The num edges in paper is 3995777033 in paper but the actual number of edges I download is 3727095830.

The shape should be (3727095830, 2), which does not match 3995777033 reported in paper. This is the link to the paper: https://arxiv.org/pdf/2302.13522

Below is the download command: wget https://igb-public-awsopen.s3.amazonaws.com/IGBH/processed/paper__cites__paper/edge_index.npy

This is the IGB-HOM info table: image

Depending work items or issues

mfbalin commented 2 months ago

@BowenYao18 is this still an issue?

BowenYao18 commented 2 months ago

@BowenYao18 is this still an issue?

https://github.com/IllinoisGraphBenchmark/IGB-Datasets/issues/55 I think they are fixing that now

I will try to see if they have updated now