Open dalcinl opened 1 week ago
Is there anything useful in the output if you set UCX_LOG_LEVEL=info
? Unfortunately I'm unable to launch a ppc64le
container on my M1 Macbook to debug interactively.
No, UCX_LOG_LEVEL=info
produced no additional output. I'm building UCX with the configure-release
script, I'll try again with a debug build.
Once again, a debug build did not produce any additional output 😞 .
I'm having issues building MPICH 4.2.3 and 4.1.3 with external UCX 1.17.0 (+ fix from https://github.com/openucx/ucx/pull/9973) on ppc64le under emulation using podman. Builds on aarch64 and x86_64 are fine.
One of the build logs is here: https://github.com/mpi4py/mpi-publish/actions/runs/11803264738/job/32880846533. I can also reproduce the problem locally.
I'm configuring using
--with-device=ch4:ofi,ucx
. I run the basic MPI helloworld example settingMPICH_CH4_NETMOD=ucx
. I'm getting the following failure:IIRC, our attempts to build MPICH with UCX on conda-forge also faced runtime issues in ppc64le. Any tips on how to further debug this issue?