Open MarcelKoch opened 1 year ago
Update: I got in contact with the support staff for the system I was using, and they were also unable to build this with the default rccl. They had to install their own rccl and use that, which worked. Still, I think it should be possible to build aws-ofi-rccl with the default rccl install, so I will not close the issue yet.
The
./configure
command does not automatically pick up the default rocm installation. On the systemROCM_PATH
is set toopt/rocm-5.3.0
, but the configure step doesn't pick this up, which I would expect from the configure help text. Instead I get the error:If I set
--with-hip=$ROCM_PATH/hip
it works, but then RCCL is not configured correctly using--with-rccl=$ROCM_PATH/rccl
. The configure step succeeds in that case, butmake
fails with