Open felker opened 2 months ago
The kokkos/4.3.01_shared_PEg8.5.0_cv12.3_ct12.2.91
module build was built with cmake flag -DKokkos_ENABLE_IMPL_CUDA_MALLOC_ASYNC=OFF
The two kokkos/4.2.01
module builds were built with the default, -DKokkos_ENABLE_IMPL_CUDA_MALLOC_ASYNC=ON
. I believe those will have trouble if you enable GPU-aware MPICH.
Perhaps we should mention that in all 3x .lua
modulefiles, in addition to the user guide.
LAMMPS, AthenaK, XGC, and other Kokkos-based applications using versions starting with 4.2.00 in Nov 2023 are affected by an incompatibility with Cray MPICH (based on an older UCX) of the new default option:
A runtime error is thrown by CUDA-aware Cray MPICH if you try to use Kokkos with that option enabled:
I assume the 3x prebuilt Kokkos modules were all compiled with that option disabled @zippylab ?
There is a discussion to potentially revert the change to the default in 4.5.x https://github.com/kokkos/kokkos/pull/7353