rapidsai / raft

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
https://docs.rapids.ai/api/raft/stable/
Apache License 2.0
683 stars 180 forks source link

ANN_BENCH: a global pool of result buffers across benchmark cases #2312

Closed achirkin closed 1 month ago

achirkin commented 1 month ago

Introduce a global pool of host/device buffers for result neighbors and distances to keep the allocations across benchmark cases. This slightly reduces the overheads of the benchmark setup, but, most importantly, reduces the number of cuda driver calls that may block the context. The latter is relevant for benchmarking persistent kernels - to let them run across multiple benchmark cases.

achirkin commented 1 month ago

/merge