Open sleepwalker2017 opened 9 months ago
hostname is not an MPI program so it doesn't often show up errors with the MPI runtime. I often just download, compile and run a simple MPI program such as:
wget https://raw.githubusercontent.com/pmodels/mpich/main/examples/cpi.c
mpicc -o cpi cpi.c
Those errors look like they're coming from UCX. Maybe try setting UCX_TLS=tcp
and maybe set UCX_NET_DEVICES
to your Ethernet device
I run
mpirun --allow-run-as-root -np 4 -hostfile hostfile echo "hello"
, it runs ok.how could this happen? why would it connect to this ip: dest_addr=172.17.0.1:42887? I'm confused about this, any help?
Thank you!