NVIDIA / cccl

CUDA C++ Core Libraries
933 stars 115 forks source link

Gather benchmark results for each CUB algorithm using different offset types #1787

Open jrhemstad opened 1 month ago

jrhemstad commented 1 month ago

In order to make an informed decision about the offset type solution, we would like to have a complete understanding of the impact on each CUB algorithm. Therefore, for each algorithm, we'd like a table that summarizes the following:

elstehle commented 1 month ago

Priority-wise, I think we want to focus on the algorithms that currently do not support large number of items yet (see overview). That is,