LLNL / Aluminum

High-performance, GPU-aware communication library
https://aluminum.readthedocs.io/en/latest/
Other
84 stars 21 forks source link

Fix CUDA 12.2 build #219

Closed ndryden closed 10 months ago

ndryden commented 10 months ago

With CUDA 12.2, there are now additional default conversions for __half (and bfloat), so we need to provide an explicit specialization of operator<< for them, except for bfloat16 on ROCm platforms, where operator<< is provided.