Replace custom cuda hashtable with cuCollections'

dmlc / dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.

http://dgl.ai

Apache License 2.0

13.47k stars 3.01k forks source link

Replace custom cuda hashtable with cuCollections' #5498

Open nv-dlasalle opened 1 year ago

nv-dlasalle commented 1 year ago

🚀 Feature

Currently, the most expensive part of GPU based sampling is running to_block(), and specifically hash table insertions. The current implementation cuda_hashtable.cuh does not make good use of the hardware, and instead we should replace it the implementation in cuCollections.

See https://developer.nvidia.com/blog/maximizing-performance-with-massively-parallel-hash-maps-on-gpus/ for a more in-depth explanation.

github-actions[bot] commented 1 year ago

This issue has been automatically marked as stale due to lack of activity. It will be closed if no further activity occurs. Thank you