Open langfield opened 4 years ago
Looks like you have a mismatched version on one of your nodes:
orted: Error: unknown option "--tree-spawn"
Only an old version of OMPI would not understand that cmd line option.
Thanks for replying. I've reinstalled on all nodes, and the unknown option
error is gone, but the rest of the output is the same error (ORTE can't route from the head node to the target node).
Thank you for taking the time to submit an issue!
Background information
What version of Open MPI are you using? (e.g., v3.0.5, v4.0.2, git branch name and hash, etc.)
v4.0.3
Describe how Open MPI was installed (e.g., from a source/distribution tarball, from a git clone, from an operating system distribution package, etc.)
From tarball.
If you are building/installing from a git clone, please copy-n-paste the output from
git submodule status
.Please describe the system on which you are running
Details of the problem
Please describe, in detail, the problem that you are having, including the behavior you expect to see, the actual behavior that you are seeing, steps to reproduce the problem, etc. It is most helpful if you can attach a small program that a developer can use to reproduce your problem.
Attempting to test 2-node cluster over WAN. Passwordless SSH is set up between both nodes via RSA, going both ways. Hostfile is as follows:
SSH access is on non-standard ports, so hostnames are aliased in
.ssh/config
a bit like:The error message below seems a bit cryptic. I'm not sure why manual SSH works between the nodes but OMPI is unable to connect. Any help appreciated.