Closed lahwaacz closed 1 month ago
I could recreate the build issue with math.cpp
using your settings and the rocm 6.2 image. I think I also have a fix, which I'm currently testing.
One comment on your cmake flags, by setting -DGINKGO_HAVE_GPU_AWARE_MPI=ON
Ginkgo will assume that it is linked against a mpi library that supports device memory. If that is not the case, the mpi applications will just crash, without any indication as to why. So my suggestion would be to disable it and leave it to users to explicitly enable it, only if they know that their mpi supports device memory.
On second thought, -DCMAKE_CXX_COMPILER=/opt/rocm/bin/hipcc
might not be the intended way to compile Ginkgo, since hipcc treats all source files as HIP language source files. When I used -DCMAKE_CXX_COMPILER=/opt/rocm/lib/llvm/bin/amdclang++
instead, the build actually passed.
I have no idea why using g++ for CXX compiler does not work anymore, though :shrug:
As for MPI, in Arch Linux we specifically have a GPU-aware OpenMPI package and don't support switching to another MPI library.
Hi! I'm working on upgrading ginkgo-hpc for Arch Linux, so far I have the following build commands (omitted the parts for base and cuda packages):
I've backported https://github.com/ginkgo-project/ginkgo/pull/1670/commits/eb97b4969c66ca0fa9e91c339c7dc409cb6a9143 but still get this error which does not seem to be fixed in math.hpp on develop:
Note that this is part of a ROCm 6.2.2 rebuild, we were not able to build ginkgo 1.8.0 with ROCm 6.0. I'm also not sure about the
-DCMAKE_CXX_COMPILER=/opt/rocm/bin/hipcc
flag which was not needed before, but without it I get errors like this (maybe an ABI error when you try to link C++ code by GCC with HIP code?):