[QST]: Enable discussion tab on github repo? / nx-cugraph vs. cugraph benchmark

rapidsai / cugraph

cuGraph - RAPIDS Graph Analytics Library

Apache License 2.0

1.77k stars 304 forks source link

	k=10
NetworkX	2min 15s
NetworkX + nx-cugraph (cold)	17.6 s
NetworkX + nx-cugraph (warm)	13.1 s
cugraph	323 ms

k=10

NetworkX

2min 15s

NetworkX + nx-cugraph (cold)

17.6 s

NetworkX + nx-cugraph (warm)

13.1 s

cugraph

323 ms

Hi @raybellwaves, thanks for the questions, suggestions, positive comments, and giving nx-cugraph a try!

Regarding "cold" vs "warm", "warm" means something different here (i.e., in your notebook, "warm" means the libraries have been imported and GPU context has been created, which is a reasonable interpretation). @rlratzel ran these benchmarks (and made that slide), and your question is a "what if...?" he was worried about. I believe "warm" in the benchmarking slide means the graph has already been converted to nx-cugraph and resides on the GPU. This requires either passing an nx-cugraph Graph to networkx, or enabling caching of backend graph conversions and having a cached nx-cugraph graph. We expect caching to come in NetworkX 3.3 (PR is open and ready to go in), which should be released soon.

Without caching, sometimes graph conversion can take a non-trivial amount of time for large graphs. We made it as fast as we could within reason (for example, it's much faster than nx.to_scipy_sparse_array), but it still needs to handle a lot of pure Python objects. NetworkX 3.3 is also adding "should_run", which lets NetworkX to ask backends whether it should convert to them to run an algorithm. We don't use this yet, but we plan to soon.

I really like the profiling idea! I know that's pretty slick with cudf.pandas. I bet we can do something similar--preferably in a generic way to support other backends.

Finally, I don't have a strong opinion on GH issues vs GH discussions. For now, it's fine to ask questions via issues.

CC @rlratzel, do you want to add anything?

rapidsai / cugraph

[QST]: Enable discussion tab on github repo? / nx-cugraph vs. cugraph benchmark #4273

What is your question?

Code of Conduct