Open czkkkkkk opened 9 months ago
@Skeleton003 could you look into it and try with the new implementation: https://github.com/dmlc/dgl/pull/6986
This issue has been automatically marked as stale due to lack of activity. It will be closed if no further activity occurs. Thank you
🚀 Feature
Motivation
Currently, preprocess_ondisk_dataset consumes much more memory than the topology of a graph itself during the preprocessing. When loading a graph with 2B nodes and 8B edges, it cannot be finished in a machine with 380 GB memory. After a rough profiling, I found that the peak memory usage is reached when converting a DGL graph to a fused sampling graph. https://github.com/dmlc/dgl/blob/4ee0a8bddbd93963b5f078c475381f4ab521d2e1/python/dgl/graphbolt/impl/ondisk_dataset.py#L212 There could be two factors contributing to the peak memory usage.
from_dglgraph
creates a temporary homogeneous graph and also its CSC format.Alternatives
Pitch
Additional context