rapidsai / raft

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
https://docs.rapids.ai/api/raft/stable/
Apache License 2.0
787 stars 195 forks source link

Upgrade to latest cutlass version #2503

Closed vyasr closed 1 week ago

vyasr commented 1 week ago

This upgrade is important because cutlass 3.4 fixed issues with kernel visibility that would otherwise lead to global kernel symbols from raft kernels using cutlass.

vyasr commented 1 week ago

I can't approve my own PR (even though Kyle did all the work) so @KyleFromNVIDIA can approve on my behalf if needed :smile: The changes all look good.

KyleFromNVIDIA commented 1 week ago

/merge