open-mpi / ompi

Open MPI main development repository
https://www.open-mpi.org
Other
2.19k stars 864 forks source link

.>[Error: Name or service not known] #1510

Closed eediaz1987 closed 8 years ago

eediaz1987 commented 8 years ago

i have a error usin OpenMpi, here atach a log fragment

UOeediaz@hn:~$ mpirun -np 1 bin/mpi_hello
[hn:08542] [[INVALID],INVALID] ORTE_ERROR_LOG: Not found in file ../../../../../../orte/mca/ess/env/ess_env_module.c at line 358
[hn:08542] oob_tcp_parse_uri: Could not resolve 10.30.1.225,192.168.1.254. [Error: Name or service not known]
[hn:08542] [[INVALID],INVALID] ORTE_ERROR_LOG: A message is attempting to be sent to a process whose contact information is unknown in file ../../../../../../orte/mca/rml/oob/rml_oob_send.c at line 146
[hn:08542] [[INVALID],INVALID] attempted to send to [[56801,0],0]
[hn:08542] [[INVALID],INVALID] ORTE_ERROR_LOG: A message is attempting to be sent to a process whose contact information is unknown in file ../../../../../orte/mca/routed/base/routed_base_register_sync.c at line 92
[hn:08542] [[INVALID],INVALID] ORTE_ERROR_LOG: A message is attempting to be sent to a process whose contact information is unknown in file ../../../../../../orte/mca/routed/binomial/routed_binomial.c at line 891
[hn:08542] [[INVALID],INVALID] ORTE_ERROR_LOG: A message is attempting to be sent to a process whose contact information is unknown in file ../../../../../orte/mca/ess/base/ess_base_std_app.c at line 151

EDIT: Used github triple-single-tick to mark the above section as verbatim.

rhc54 commented 8 years ago

I'm afraid I need a little more info than that - at least what OMPI version you are using?

eediaz1987 commented 8 years ago

i'm using OMPI 1.8.4-gcc 4.9.2, i instaled using easybuild

jsquyres commented 8 years ago

FWIW, I've seen this kind of message when I inadvertently used Open MPI version X on one machine and Open MPI version Y on another.

I've also seen it when I compiled my MPI app with Open MPI version X and then used mpirun from Open MPI version Y.

Can you confirm that you are using a single, consistent version/installation of Open MPI?

ggouaillardet commented 8 years ago

@eediaz1987 this error can only occur if ipv6 is explicitly enabled at configure time with --enable-ipv6, and this is does not seem to be the case with easybuild https://github.com/hpcugent/easybuild-easyconfigs/blob/master/easybuild/easyconfigs/o/OpenMPI/OpenMPI-1.8.4-GCC-4.9.2.eb that being said, i am even surprise you ran into this at first glance, parse_uri should be given one host name, and not a comma separated list of hostnames as advised by @jsquyres, please make sure you always use the same openmpi version (at compile time, at run time and accross all your hosts)

as a side note, you can explicitly list the network used by oob/tcp for example mpirun --mca oob_tcp_if_include 10.30.1.0/24 ...

eediaz1987 commented 8 years ago

for this errror [hn:27581] oob_tcp_parse_uri: Could not resolve 10.30.1.225,192.168.1.254. [Error: Name or service not known] i run the command host 10.30.1.225 and the answer was host 10.30.1.225 225.1.30.10.in-addr.arpa domain name pointer hn.hpc.uo.edu.cu. and i obtenin the same answer for the other ip

2016-03-30 19:36 GMT-05:00 Gilles Gouaillardet notifications@github.com:

@eediaz1987 https://github.com/eediaz1987 this error can only occur if ipv6 is explicitly enabled at configure time with --enable-ipv6, and this is does not seem to be the case with easybuild https://github.com/hpcugent/easybuild-easyconfigs/blob/master/easybuild/easyconfigs/o/OpenMPI/OpenMPI-1.8.4-GCC-4.9.2.eb that being said, i am even surprise you ran into this at first glance, parse_uri should be given one host name, and not a comma separated list of hostnames as advised by @jsquyres https://github.com/jsquyres, please make sure you always use the same openmpi version (at compile time, at run time and accross all your hosts)

as a side note, you can explicitly list the network used by oob/tcp for example mpirun --mca oob_tcp_if_include 10.30.1.0/24 ...

— You are receiving this because you were mentioned. Reply to this email directly or view it on GitHub https://github.com/open-mpi/ompi/issues/1510#issuecomment-203696841

eediaz1987 commented 8 years ago

yes i'm using only OMPI 1.8.4-gcc 4.9.2

jsquyres commented 8 years ago

Can you send the full output from https://www.open-mpi.org/community/help/ ?

ggouaillardet commented 8 years ago

can you also run mpirun --mca oob_base_verbose 100 -np 1 bin/mpi_hello and post the output

eediaz1987 commented 8 years ago

i run that line and this is the output UOeediaz@hn:~$ mpirun --mca oob_base_verbose 100 -np 1 bin/mpi_hello [hn:09412] mca: base: components_register: registering oob components [hn:09412] mca: base: components_register: found loaded component tcp [hn:09412] mca: base: components_register: component tcp register function succe ssful [hn:09412] mca: base: components_open: opening oob components [hn:09412] mca: base: components_open: found loaded component tcp [hn:09412] mca: base: components_open: component tcp open function successful [hn:09412] mca:oob:select: checking available component tcp [hn:09412] mca:oob:select: Querying component [tcp] [hn:09412] oob:tcp: component_available called [hn:09412] WORKING INTERFACE 1 KERNEL INDEX 1 FAMILY: V4 [hn:09412] [[55417,0],0] oob:tcp:init rejecting loopback interface lo [hn:09412] WORKING INTERFACE 2 KERNEL INDEX 2 FAMILY: V4 [hn:09412] [[55417,0],0] oob:tcp:init adding 10.30.1.225 to our list of V4 conne ctions [hn:09412] WORKING INTERFACE 3 KERNEL INDEX 3 FAMILY: V4 [hn:09412] [[55417,0],0] oob:tcp:init adding 192.168.1.254 to our list of V4 con nections [hn:09412] [[55417,0],0] TCP STARTUP [hn:09412] [[55417,0],0] attempting to bind to IPv4 port 0 [hn:09412] [[55417,0],0] assigned IPv4 port 40524 [hn:09412] mca:oob:select: Adding component to end [hn:09412] mca:oob:select: Found 1 active transports [hn:09412] [[55417,0],0]: set_addr to uri 3631808512.0;tcp://10.30.1.225,192.168 .1.254:40524 [hn:09412] [[55417,0],0]:set_addr peer [[55417,0],0] is me [hn:09414] [[INVALID],INVALID] ORTE_ERROR_LOG: Not found in file ../../../../../ ../orte/mca/ess/env/ess_env_module.c at line 358 [hn:09414] mca: base: components_open: Looking for oob components [hn:09414] mca: base: components_open: opening oob components [hn:09414] mca: base: components_open: found loaded component tcp [hn:09414] mca: base: components_open: component tcp has no register function [hn:09414] mca: base: components_open: component tcp open function successful [hn:09414] oob_tcp_parse_uri: Could not resolve 10.30.1.225,192.168.1.254. [Error: Name or service not known] [hn:09414] [[INVALID],INVALID] ORTE_ERROR_LOG: A message is attempting to be sent to a process whose contact information is unknown in file ../../../../../../orte/mca/rml/oob/rml_oob_send.c at line 146 [hn:09414] [[INVALID],INVALID] attempted to send to [[55417,0],0] [hn:09414] [[INVALID],INVALID] ORTE_ERROR_LOG: A message is attempting to be sent to a process whose contact information is unknown in file ../../../../../orte/mca/routed/base/routed_base_register_sync.c at line 92 [hn:09414] [[INVALID],INVALID] ORTE_ERROR_LOG: A message is attempting to be sent to a process whose contact information is unknown in file ../../../../../../orte/mca/routed/binomial/routed_binomial.c at line 891 [hn:09414] [[INVALID],INVALID] ORTE_ERROR_LOG: A message is attempting to be sent to a process whose contact information is unknown in file ../../../../../orte/mca/ess/base/ess_base_std_app.c at line 151

2016-04-03 17:29 GMT-05:00 Gilles Gouaillardet notifications@github.com:

can you also run mpirun --mca oob_base_verbose 100 -np 1 bin/mpi_hello and post the output

— You are receiving this because you were mentioned. Reply to this email directly or view it on GitHub https://github.com/open-mpi/ompi/issues/1510#issuecomment-205068037

ggouaillardet commented 8 years ago

line 358 from ess_env_module.c does not seem to come from version 1.8.4 !

can you please double or triple check you are using the correct library

mpirun --version
ldd bin/mpi_hello

can you also post the output of

ompi_info --all

and the config.status easy build generated ?

ggouaillardet commented 8 years ago

@eediaz1987 I checked the sources, and as far as I am concerned, these traces can only make sense with mpirun from v1.8 series or later, whereas your app is using v1.6 series libraries

eediaz1987 commented 8 years ago

mpirun --version ldd bin/mpi_hello for this command the output is mpirun (Open MPI) 1.8.4 Report bugs to http://www.open-mpi.org/community/help/

and in the atachment is the output of the other command

2016-04-05 6:39 GMT-05:00 Gilles Gouaillardet notifications@github.com:

@eediaz1987 https://github.com/eediaz1987 I checked the sources, and as far as I am concerned, these traces can only make sense with mpirun from v1.8 series or later, whereas your app is using v1.6 series libraries

— You are receiving this because you were mentioned. Reply to this email directly or view it on GitHub https://github.com/open-mpi/ompi/issues/1510#issuecomment-205765766

             Package: Open MPI oau@hn Distribution
            Open MPI: 1.8.4

Open MPI repo revision: v1.8.3-330-g0344f04 Open MPI release date: Dec 19, 2014 Open RTE: 1.8.4 Open RTE repo revision: v1.8.3-330-g0344f04 Open RTE release date: Dec 19, 2014 OPAL: 1.8.4 OPAL repo revision: v1.8.3-330-g0344f04 OPAL release date: Dec 19, 2014 MPI API: 3.0 Ident string: 1.8.4 Prefix: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2 Exec_prefix: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2 Bindir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/bin Sbindir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/sbin Libdir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/lib Incdir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/include Mandir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/share/man Pkglibdir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/lib/openmpi Libexecdir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/libexec Datarootdir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/share Datadir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/share Sysconfdir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/etc Sharedstatedir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/com Localstatedir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/var Infodir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/share/info Pkgdatadir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/share/openmpi Pkglibdir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/lib/openmpi Pkgincludedir: /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/include/openmpi Configured architecture: x86_64-unknown-linux-gnu Configure host: hn Configured by: oau Configured on: Tue Mar 22 07:40:33 CDT 2016 Configure host: hn Built by: oau Built on: mar mar 22 07:50:12 CDT 2016 Built host: hn C bindings: yes C++ bindings: yes Fort mpif.h: yes (all) Fort use mpi: yes (full: ignore TKR) Fort use mpi size: deprecated-ompi-info-value Fort use mpi_f08: yes Fort mpi_f08 compliance: The mpi_f08 module is available, but due to limitations in the gfortran compiler, does not support the following: array subsections, direct passthru (where possible) to underlying Open MPI's C functionality Fort mpi_f08 subarrays: no Java bindings: no Wrapper compiler rpath: runpath C compiler: gcc C compiler absolute: /opt/librarieshpc/easybuild/software/GCC/4.9.2/bin/gcc C compiler family name: GNU C compiler version: 4.9.2 C char size: 1 C bool size: 1 C short size: 2 C int size: 4 C long size: 8 C float size: 4 C double size: 8 C pointer size: 8 C char align: 1 C bool align: 1 C int align: 4 C float align: 4 C double align: 8 C++ compiler: g++ C++ compiler absolute: /opt/librarieshpc/easybuild/software/GCC/4.9.2/bin/g++ Fort compiler: gfortran Fort compiler abs: /opt/librarieshpc/easybuild/software/GCC/4.9.2/bin/gfortran Fort ignore TKR: yes (!GCC$ ATTRIBUTES NO_ARG_CHECK ::) Fort 08 assumed shape: yes Fort optional args: yes Fort INTERFACE: yes Fort ISO_FORTRAN_ENV: yes Fort STORAGE_SIZE: yes Fort BIND(C) (all): yes Fort ISO_C_BINDING: yes Fort SUBROUTINE BIND(C): yes Fort TYPE,BIND(C): yes Fort T,BIND(C,name="a"): yes Fort PRIVATE: yes Fort PROTECTED: yes Fort ABSTRACT: yes Fort ASYNCHRONOUS: yes Fort PROCEDURE: yes Fort C_FUNLOC: yes Fort f08 using wrappers: yes Fort MPI_SIZEOF: yes Fort integer size: 4 Fort logical size: 4 Fort logical value true: 1 Fort have integer1: yes Fort have integer2: yes Fort have integer4: yes Fort have integer8: yes Fort have integer16: no Fort have real4: yes Fort have real8: yes Fort have real16: yes Fort have complex8: yes Fort have complex16: yes Fort have complex32: yes Fort integer1 size: 1 Fort integer2 size: 2 Fort integer4 size: 4 Fort integer8 size: 8 Fort integer16 size: -1 Fort real size: 4 Fort real4 size: 4 Fort real8 size: 8 Fort real16 size: 16 Fort dbl prec size: 8 Fort cplx size: 8 Fort dbl cplx size: 16 Fort cplx8 size: 8 Fort cplx16 size: 16 Fort cplx32 size: 32 Fort integer align: 4 Fort integer1 align: 1 Fort integer2 align: 2 Fort integer4 align: 4 Fort integer8 align: 8 Fort integer16 align: -1 Fort real align: 4 Fort real4 align: 4 Fort real8 align: 8 Fort real16 align: 16 Fort dbl prec align: 8 Fort cplx align: 4 Fort dbl cplx align: 8 Fort cplx8 align: 4 Fort cplx16 align: 8 Fort cplx32 align: 16 C profiling: yes C++ profiling: yes Fort mpif.h profiling: yes Fort use mpi profiling: yes Fort use mpi_f08 prof: yes C++ exceptions: no Thread support: posix (MPI_THREAD_MULTIPLE: yes, OPAL support: yes, OMPI progress: no, ORTE progress: yes, Event lib: yes) Sparse Groups: no Build CFLAGS: -DNDEBUG -O2 -march=native -finline-functions -fno-strict-aliasing Build CXXFLAGS: -DNDEBUG -O2 -march=native -finline-functions Build FCFLAGS: -O2 -march=native Build LDFLAGS: -L/opt/librarieshpc/easybuild/software/GCC/4.9.2/lib64 -L/opt/librarieshpc/easybuild/software/GCC/4.9.2/lib -L/opt/librarieshpc/easybuild/software/hwloc/1.10.0-GCC-4.9.2/lib -L/opt/librarieshpc/easybuild/software/hwloc/1.10.0-GCC-4.9.2/lib Build LIBS: -lrt -lutil -lm -lpthread -lhwloc Wrapper extra CFLAGS: Wrapper extra CXXFLAGS: Wrapper extra FCFLAGS: Wrapper extra LDFLAGS: -L/usr/local/lib -Wl,--rpath -Wl,/usr/local/lib -Wl,--rpath -Wl,/usr/local/lib -Wl,--rpath -Wl,/usr/local/lib -Wl,-rpath -Wl,/usr/local/lib -Wl,-rpath -Wl,@{libdir} -Wl,--enable-new-dtags Wrapper extra LIBS: -ldl -lrt -ltorque -libverbs -lutil -lm -lpthread -lhwloc Internal debug support: no MPI interface warnings: yes MPI parameter check: runtime Memory profiling support: no Memory debugging support: no libltdl support: no Heterogeneous support: no mpirun default --prefix: yes MPI I/O support: yes MPI_WTIME support: gettimeofday Symbol vis. support: yes Host topology support: yes MPI extensions: FT Checkpoint support: no (checkpoint thread: no) C/R Enabled Debugging: no VampirTrace support: yes MPI_MAX_PROCESSOR_NAME: 256 MPI_MAX_ERROR_STRING: 256 MPI_MAX_OBJECT_NAME: 64 MPI_MAX_INFO_KEY: 36 MPI_MAX_INFO_VAL: 256 MPI_MAX_PORT_NAME: 1024 MPI_MAX_DATAREP_STRING: 128 MCA mca: parameter "mca_param_files" (current value: "/shared/home/UOeediaz/.openmpi/mca-params.conf:/opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/etc/openmpi-mca-params.conf", data source: default, level: 2 user/detail, type: string, deprecated, synonym of: mca_base_param_files) Path for MCA configuration files containing variable values MCA mca: parameter "mca_component_path" (current value: "/opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/lib/openmpi:/shared/home/UOeediaz/.openmpi/components", data source: default, level: 9 dev/all, type: string, deprecated, synonym of: mca_base_component_path) Path where to look for Open MPI and ORTE components MCA mca: parameter "mca_component_show_load_errors" (current value: "true", data source: default, level: 9 dev/all, type: bool, deprecated, synonym of: mca_base_component_show_load_errors) Whether to show errors for components that failed to load or not Valid values: 0: f|false|disabled, 1: t|true|enabled MCA mca: parameter "mca_component_disable_dlopen" (current value: "false", data source: default, level: 9 dev/all, type: bool, deprecated, synonym of: mca_base_component_disable_dlopen) Whether to attempt to disable opening dynamic components or not Valid values: 0: f|false|disabled, 1: t|true|enabled MCA mca: parameter "mca_verbose" (current value: "stderr", data source: default, level: 9 dev/all, type: string, deprecated, synonym of: mca_base_verbose) Specifies where the default error output stream goes (this is separate from distinct help messages). Accepts a comma-delimited list of: stderr, stdout, syslog, syslogpri:<notice|info|debug>, syslogid: (where str is the prefix string for all syslog notices), file[:filename](if filename is not specified, a default filename is used), fileappend (if not specified, the file is opened for truncation), level[:N](if specified, integer verbose level; otherwise, 0 is implied) MCA mca: parameter "mca_base_param_files" (current value: "/shared/home/UOeediaz/.openmpi/mca-params.conf:/opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/etc/openmpi-mca-params.conf", data source: default, level: 2 user/detail, type: string, synonyms: mca_param_files) Path for MCA configuration files containing variable values MCA mca: informational "mca_base_override_param_file" (current value: "/opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/etc/openmpi-mca-params-override.conf", data source: default, level: 2 user/detail, type: string) Variables set in this file will override any value set inthe environment or another configuration file MCA mca: parameter "mca_base_suppress_override_warning" (current value: "false", data source: default, level: 2 user/detail, type: bool) Suppress warnings when attempting to set an overridden value (default: false) Valid values: 0: f|false|disabled, 1: t|true|enabled MCA mca: parameter "mca_base_param_file_prefix" (current value: "", data source: default, level: 3 user/all, type: string) Aggregate MCA parameter file sets MCA mca: parameter "mca_base_param_file_path" (current value: "/opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/share/openmpi/amca-param-sets:/shared/home/UOeediaz", data source: default, level: 3 user/all, type: string) Aggregate MCA parameter Search path MCA mca: parameter "mca_base_param_file_path_force" (current value: "", data source: default, level: 3 user/all, type: string) Forced Aggregate MCA parameter Search path MCA mca: parameter "mca_base_component_path" (current value: "/opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/lib/openmpi:/shared/home/UOeediaz/.openmpi/components", data source: default, level: 9 dev/all, type: string, synonyms: mca_component_path) Path where to look for Open MPI and ORTE components MCA mca: parameter "mca_base_component_show_load_errors" (current value: "true", data source: default, level: 9 dev/all, type: bool, synonyms: mca_component_show_load_errors) Whether to show errors for components that failed to load or not Valid values: 0: f|false|disabled, 1: t|true|enabled MCA mca: parameter "mca_base_component_disable_dlopen" (current value: "false", data source: default, level: 9 dev/all, type: bool, synonyms: mca_component_disable_dlopen) Whether to attempt to disable opening dynamic components or not Valid values: 0: f|false|disabled, 1: t|true|enabled MCA mca: parameter "mca_base_verbose" (current value: "stderr", data source: default, level: 9 dev/all, type: string, synonyms: mca_verbose) Specifies where the default error output stream goes (this is separate from distinct help messages). Accepts a comma-delimited list of: stderr, stdout, syslog, syslogpri:<notice|info|debug>, syslogid: (where str is the prefix string for all syslog notices), file[:filename](if filename is not specified, a default filename is used), fileappend (if not specified, the file is opened for truncation), level[:N](if specified, integer verbose level; otherwise, 0 is implied) MCA opal: parameter "opal_signal" (current value: "6,7,8,11", data source: default, level: 3 user/all, type: string) Comma-delimited list of integer signal numbers to Open MPI to attempt to intercept. Upon receipt of the intercepted signal, Open MPI will display a stack trace and abort. Open MPI will not replace signals if handlers are already installed by the time MPI_INIT is invoked. Optionally append ":complain" to any signal number in the comma-delimited list to make Open MPI complain if it detects another signal handler (and therefore does not insert its own). MCA opal: parameter "opal_set_max_sys_limits" (current value: "", data source: default, level: 3 user/all, type: string) Set the specified system-imposed limits to the specified value, including "unlimited".Supported params: core, filesize, maxmem, openfiles, stacksize, maxchildren MCA opal: parameter "opal_net_private_ipv4" (current value: "10.0.0.0/8;172.16.0.0/12;192.168.0.0/16;169.254.0.0/16", data source: default, level: 3 user/all, type: string) Semicolon-delimited list of CIDR notation entries specifying what networks are considered "private" (default value based on RFC1918 and RFC3330) MCA opal: parameter "opal_event_include" (current value: "poll", data source: default, level: 3 user/all, type: string, synonym of: event_libevent2021_event_include) Comma-delimited list of libevent subsystems to use (epoll, poll, select -- available on your platform) MCA backtrace: parameter "backtrace" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the backtrace framework ( means use all components that can be found) MCA backtrace: parameter "backtrace_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the backtrace framework (0 = no verbosity) MCA compress: parameter "compress" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the compress framework ( means use all components that can be found) MCA compress: parameter "compress_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the compress framework (0 = no verbosity) MCA compress: parameter "compress_gzip_priority" (current value: "15", data source: default, level: 9 dev/all, type: int) Priority of the COMPRESS gzip component (default: 15) MCA compress: parameter "compress_gzip_verbose" (current value: "0", data source: default, level: 9 dev/all, type: int) Verbose level for the COMPRESS gzip component MCA compress: parameter "compress_bzip_priority" (current value: "10", data source: default, level: 9 dev/all, type: int) Priority of the COMPRESS bzip component (default: 10) MCA compress: parameter "compress_bzip_verbose" (current value: "0", data source: default, level: 9 dev/all, type: int) Verbose level for the COMPRESS bzip component MCA crs: parameter "crs" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the crs framework ( means use all components that can be found) MCA crs: parameter "crs_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the crs framework (0 = no verbosity) MCA crs: informational "crs_none_priority" (current value: "1", data source: default, level: 3 user/all, type: int) Priority of the crs none component MCA crs: parameter "crs_none_select_warning" (current value: "false", data source: default, level: 9 dev/all, type: bool) Enable warning when the 'none' component is selected when checkpoint/restart functionality is requested.[Default = disabled/no-warning] Valid values: 0: f|false|disabled, 1: t|true|enabled MCA db: parameter "db" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the db framework ( means use all components that can be found) MCA db: parameter "db_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the db framework (0 = no verbosity) MCA db: parameter "db_print_file" (current value: "", data source: default, level: 9 dev/all, type: string) Print to the indicated file (- => stdout, + => stderr) MCA db: parameter "db_hash_store_priority" (current value: "1", data source: default, level: 9 dev/all, type: int) Priority dictating order in which store commands will given to database components MCA db: parameter "db_hash_fetch_priority" (current value: "100", data source: default, level: 9 dev/all, type: int) Priority dictating order in which fetch commands will given to database components MCA event: parameter "event" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the event framework ( means use all components that can be found) MCA event: parameter "event_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the event framework (0 = no verbosity) MCA event: parameter "event_libevent2021_event_include" (current value: "poll", data source: default, level: 3 user/all, type: string, synonyms: opal_event_include) Comma-delimited list of libevent subsystems to use (epoll, poll, select -- available on your platform) MCA hwloc: parameter "hwloc" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the hwloc framework ( means use all components that can be found) MCA hwloc: parameter "hwloc_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the hwloc framework (0 = no verbosity) MCA hwloc: parameter "hwloc_base_mem_alloc_policy" (current value: "none", data source: default, level: 9 dev/all, type: int) General memory allocations placement policy (this is not memory binding). "none" means that no memory policy is applied. "local_only" means that a process' memory allocations will be restricted to its local NUMA node. If using direct launch, this policy will not be in effect until after MPI_INIT. Note that operating system paging policies are unaffected by this setting. For example, if "local_only" is used and local NUMA node memory is exhausted, a new memory allocation may cause paging. Valid values: 0:"none", 1:"local_only" MCA hwloc: parameter "hwloc_base_mem_bind_failure_action" (current value: "warn", data source: default, level: 9 dev/all, type: int) What Open MPI will do if it explicitly tries to bind memory to a specific NUMA location, and fails. Note that this is a different case than the general allocation policy described by hwloc_base_alloc_policy. A value of "silent" means that Open MPI will proceed without comment. A value of "warn" means that Open MPI will warn the first time this happens, but allow the job to continue (possibly with degraded performance). A value of "error" means that Open MPI will abort the job if this happens. Valid values: 0:"silent", 1:"warn", 2:"error" MCA hwloc: parameter "hwloc_base_binding_policy" (current value: "", data source: default, level: 9 dev/all, type: string) Policy for binding processes. Allowed values: none, hwthread, core, l1cache, l2cache, l3cache, socket, numa, board ("none" is the default when oversubscribed, "core" is the default when np<=2, and "socket" is the default when np>2). Allowed qualifiers: overload-allowed, if-supported MCA hwloc: parameter "hwloc_base_bind_to_core" (current value: "false", data source: default, level: 9 dev/all, type: bool) Bind processes to cores Valid values: 0: f|false|disabled, 1: t|true|enabled MCA hwloc: parameter "hwloc_base_bind_to_socket" (current value: "false", data source: default, level: 9 dev/all, type: bool) Bind processes to sockets Valid values: 0: f|false|disabled, 1: t|true|enabled MCA hwloc: parameter "hwloc_base_report_bindings" (current value: "false", data source: default, level: 9 dev/all, type: bool) Report bindings to stderr Valid values: 0: f|false|disabled, 1: t|true|enabled MCA hwloc: parameter "hwloc_base_slot_list" (current value: "", data source: default, level: 9 dev/all, type: string) List of processor IDs to bind processes to [default=NULL] MCA hwloc: parameter "hwloc_base_cpu_set" (current value: "", data source: default, level: 9 dev/all, type: string) Comma-separated list of ranges specifying logical cpus allocated to this job [default: none] MCA hwloc: parameter "hwloc_base_use_hwthreads_as_cpus" (current value: "false", data source: default, level: 9 dev/all, type: bool) Use hardware threads as independent cpus Valid values: 0: f|false|disabled, 1: t|true|enabled MCA hwloc: parameter "hwloc_base_topo_file" (current value: "", data source: default, level: 9 dev/all, type: string) Read local topology from file instead of directly sensing it MCA if: parameter "if" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the if framework ( means use all components that can be found) MCA if: parameter "if_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the if framework (0 = no verbosity) MCA if: parameter "if_base_do_not_resolve" (current value: "false", data source: default, level: 9 dev/all, type: bool) If nonzero, do not attempt to resolve interfaces Valid values: 0: f|false|disabled, 1: t|true|enabled MCA if: parameter "if_base_retain_loopback" (current value: "false", data source: default, level: 9 dev/all, type: bool) If nonzero, retain loopback interfaces Valid values: 0: f|false|disabled, 1: t|true|enabled MCA memchecker: parameter "memchecker" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the memchecker framework ( means use all components that can be found) MCA memchecker: parameter "memchecker_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the memchecker framework (0 = no verbosity) MCA memcpy: parameter "memcpy" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the memcpy framework ( means use all components that can be found) MCA memcpy: parameter "memcpy_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the memcpy framework (0 = no verbosity) MCA memory: parameter "memory" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the memory framework ( means use all components that can be found) MCA memory: parameter "memory_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the memory framework (0 = no verbosity) MCA memory: parameter "memory_linux_ptmalloc2_available" (current value: "true", data source: default, level: 3 user/all, type: bool) Whether ptmalloc2 support is included in Open MPI or not (1 = yes, 0 = no) Valid values: 0: f|false|disabled, 1: t|true|enabled MCA memory: parameter "memory_linux_ummunotify_available" (current value: "false", data source: default, level: 3 user/all, type: bool) Whether ummunotify support is included in Open MPI or not (1 = yes, 0 = no) Valid values: 0: f|false|disabled, 1: t|true|enabled MCA memory: parameter "memory_linux_ptmalloc2_enable" (current value: "-1", data source: default, level: 3 user/all, type: int) Whether to enable ptmalloc2 support or not (negative = try to enable, but continue even if support is not available, 0 = do not enable support, positive = try to enable and fail if support is not available) MCA memory: parameter "memory_linux_ummunotify_enable" (current value: "-1", data source: default, level: 3 user/all, type: int) Whether to enable ummunotify support or not (negative = try to enable, but continue even if support is not available, 0 = do not enable support, positive = try to enable and fail if support is not available) MCA memory: parameter "memory_linux_disable" (current value: "false", data source: default, level: 3 user/all, type: bool) If this MCA parameter is set to 1 VIA ENVIRONMENT VARIABLE ONLY* (this MCA parameter CANNOT be set in a file or on the mpirun command line!), this component will be disabled and will not attempt to use either ummunotify or memory hook support Valid values: 0: f|false|disabled, 1: t|true|enabled MCA pstat: parameter "pstat" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the pstat framework ( means use all components that can be found) MCA pstat: parameter "pstat_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the pstat framework (0 = no verbosity) MCA sec: parameter "sec" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the sec framework ( means use all components that can be found) MCA sec: parameter "sec_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the sec framework (0 = no verbosity) MCA shmem: parameter "shmem" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the shmem framework ( means use all components that can be found) MCA shmem: parameter "shmem_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the shmem framework (0 = no verbosity) MCA shmem: parameter "shmem_mmap_priority" (current value: "50", data source: default, level: 3 user/all, type: int) Priority for shmem mmap component (default: 50) MCA shmem: parameter "shmem_mmap_enable_nfs_warning" (current value: "true", data source: default, level: 9 dev/all, type: bool) Enable the warning emitted when Open MPI detects that its shared memory backing file is located on a network filesystem (1 = enabled, 0 = disabled). Valid values: 0: f|false|disabled, 1: t|true|enabled MCA shmem: parameter "shmem_mmap_relocate_backing_file" (current value: "0", data source: default, level: 9 dev/all, type: int) Whether to change the default placement of backing files or not (Negative = try to relocate backing files to an area rooted at the path specified by shmem_mmap_backing_file_base_dir, but continue with the default path if the relocation fails, 0 = do not relocate, Positive = same as the negative option, but will fail if the relocation fails. MCA shmem: parameter "shmem_mmap_backing_file_base_dir" (current value: "/dev/shm", data source: default, level: 9 dev/all, type: string) Specifies where backing files will be created when shmem_mmap_relocate_backing_file is in use. MCA shmem: parameter "shmem_posix_priority" (current value: "40", data source: default, level: 3 user/all, type: int) Priority for the shmem posix component (default: 40) MCA shmem: parameter "shmem_sysv_priority" (current value: "30", data source: default, level: 3 user/all, type: int) Priority for the shmem sysv component (default: 30) MCA timer: parameter "timer" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the timer framework ( means use all components that can be found) MCA timer: parameter "timer_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the timer framework (0 = no verbosity) MCA orte: parameter "orte_tmpdir_base" (current value: "", data source: default, level: 9 dev/all, type: string) Base of the session directory tree to be used by all processes MCA orte: parameter "orte_local_tmpdir_base" (current value: "", data source: default, level: 9 dev/all, type: string) Base of the session directory tree to be used by orterun/mpirun MCA orte: parameter "orte_remote_tmpdir_base" (current value: "", data source: default, level: 9 dev/all, type: string) Base of the session directory tree on remote nodes, if required to be different from head node MCA orte: parameter "orte_no_session_dirs" (current value: "", data source: default, level: 9 dev/all, type: string) Prohibited locations for session directories (multiple locations separated by ',', default=NULL) MCA orte: parameter "orte_create_session_dirs" (current value: "true", data source: default, level: 9 dev/all, type: bool) Create session directories Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_execute_quiet" (current value: "false", data source: default, level: 9 dev/all, type: bool) Do not output error and help messages Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_report_silent_errors" (current value: "false", data source: default, level: 9 dev/all, type: bool) Report all errors, including silent ones Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_debug" (current value: "false", data source: default, level: 9 dev/all, type: bool) Top-level ORTE debug switch (default: false) Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_debug_verbose" (current value: "-1", data source: default, level: 9 dev/all, type: int) Verbosity level for ORTE debug messages (default: 1) MCA orte: parameter "orte_debug_daemons_file" (current value: "false", data source: default, level: 9 dev/all, type: bool) Whether want stdout/stderr of daemons to go to a file or not Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_debug_daemons" (current value: "false", data source: default, level: 9 dev/all, type: bool) Whether to debug the ORTE daemons or not Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_progress_thread_debug" (current value: "-1", data source: default, level: 9 dev/all, type: int) Debug level for ORTE progress threads MCA orte: parameter "orte_leave_session_attached" (current value: "false", data source: default, level: 9 dev/all, type: bool) Whether applications and/or daemons should leave their sessions attached so that any output can be received - this allows X forwarding without all the attendant debugging output Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_output_debugger_proctable" (current value: "false", data source: default, level: 9 dev/all, type: bool) Whether or not to output the debugger proctable after launch (default: false) Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_debugger_test_daemon" (current value: "", data source: default, level: 9 dev/all, type: string) Name of the executable to be used to simulate a debugger colaunch (relative or absolute path) MCA orte: parameter "orte_debugger_test_attach" (current value: "false", data source: default, level: 9 dev/all, type: bool) Test debugger colaunch after debugger attachment Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_debugger_check_rate" (current value: "0", data source: default, level: 9 dev/all, type: int) Set rate (in secs) for auto-detect of debugger attachment (0 => do not check) MCA orte: parameter "orte_do_not_launch" (current value: "false", data source: default, level: 9 dev/all, type: bool) Perform all necessary operations to prepare to launch the application, but do not actually launch it Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_daemon_spin" (current value: "false", data source: default, level: 9 dev/all, type: bool) Have any orteds spin until we can connect a debugger to them Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_daemon_fail" (current value: "-1", data source: default, level: 9 dev/all, type: int) Have the specified orted fail after init for debugging purposes MCA orte: parameter "orte_daemon_fail_delay" (current value: "0", data source: default, level: 9 dev/all, type: int) Have the specified orted fail after specified number of seconds (default: 0 => no delay) MCA orte: parameter "orte_startup_timeout" (current value: "0", data source: default, level: 9 dev/all, type: int) Seconds to wait for startup or job launch before declaring failed_to_start (default: 0 => do not check) MCA orte: parameter "orte_timing_details" (current value: "false", data source: default, level: 9 dev/all, type: bool) Request that detailed timing data by reported Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_timing" (current value: "false", data source: default, level: 9 dev/all, type: bool) Request that critical timing loops be measured Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_base_user_debugger" (current value: "totalview @mpirun@ -a @mpirun_args@ : ddt -n @np@ -start @executable@ @executable_argv@ @single_app@ : fxp @mpirun@ -a @mpirun_args@", data source: default, level: 9 dev/all, type: string) Sequence of user-level debuggers to search for in orterun MCA orte: parameter "orte_default_hostfile" (current value: "/opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/etc/openmpi-default-hostfile", data source: default, level: 9 dev/all, type: string) Name of the default hostfile (relative or absolute path, "none" to ignore environmental or default MCA param setting) MCA orte: parameter "orte_node_regex" (current value: "", data source: default, level: 9 dev/all, type: string) Regular expression defining nodes in the system MCA orte: parameter "orte_keep_fqdn_hostnames" (current value: "false", data source: default, level: 9 dev/all, type: bool) Whether or not to keep FQDN hostnames [default: no] Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_retain_aliases" (current value: "false", data source: default, level: 9 dev/all, type: bool) Whether or not to keep aliases for host names [default: no] Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_hostname_alias_index" (current value: "1", data source: default, level: 9 dev/all, type: int) If hostname aliases are being retained, which one to use for the debugger proc table [default: 1st alias] MCA orte: parameter "orte_xml_output" (current value: "false", data source: default, level: 9 dev/all, type: bool) Display all output in XML format (default: false) Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_tag_output" (current value: "false", data source: default, level: 9 dev/all, type: bool) Tag all output with [job,rank](default: false) Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_xml_file" (current value: "", data source: default, level: 9 dev/all, type: string) Provide all output in XML format to the specified file MCA orte: parameter "orte_timestamp_output" (current value: "false", data source: default, level: 9 dev/all, type: bool) Timestamp all application process output (default: false) Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_output_filename" (current value: "", data source: default, level: 9 dev/all, type: string) Redirect output from application processes into filename.rank [default: NULL] MCA orte: parameter "orte_show_resolved_nodenames" (current value: "false", data source: default, level: 9 dev/all, type: bool) Display any node names that are resolved to a different name (default: false) Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_hetero_nodes" (current value: "false", data source: default, level: 9 dev/all, type: bool) Nodes in cluster may differ in topology, so send the topology back from each node [Default = false] Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_launch_agent" (current value: "orted", data source: default, level: 9 dev/all, type: string) Command used to start processes on remote nodes (default: orted) MCA orte: parameter "orte_fork_agent" (current value: "", data source: default, level: 9 dev/all, type: string) Command used to fork processes on remote nodes (default: NULL) MCA orte: parameter "orte_allocation_required" (current value: "false", data source: default, level: 9 dev/all, type: bool) Whether or not an allocation by a resource manager is required [default: no] Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_map_stddiag_to_stderr" (current value: "false", data source: default, level: 9 dev/all, type: bool) Map output from opal_output to stderr of the local process [default: no] Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_xterm" (current value: "", data source: default, level: 9 dev/all, type: string) Create a new xterm window and display output from the specified ranks there [default: none] MCA orte: parameter "orte_forward_job_control" (current value: "false", data source: default, level: 9 dev/all, type: bool) Forward SIGTSTP (after converting to SIGSTOP) and SIGCONT signals to the application procs [default: no] Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_report_launch_progress" (current value: "false", data source: default, level: 9 dev/all, type: bool) Output a brief periodic report on launch progress [default: no] Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_report_events" (current value: "", data source: default, level: 9 dev/all, type: string) URI to which events are to be reported (default: NULL) MCA orte: parameter "orte_enable_recovery" (current value: "false", data source: default, level: 9 dev/all, type: bool) Enable recovery from process failure [Default = disabled] Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_max_restarts" (current value: "0", data source: default, level: 9 dev/all, type: int) Max number of times to restart a failed process MCA orte: parameter "orte_abort_on_non_zero_status" (current value: "true", data source: default, level: 9 dev/all, type: bool) Abort the job if any process returns a non-zero exit status - no restart in such cases Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_allowed_exit_without_sync" (current value: "false", data source: default, level: 9 dev/all, type: bool) Process exiting without calling finalize will not trigger job termination Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_staged_execution" (current value: "false", data source: default, level: 9 dev/all, type: bool) Staged execution is being used Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_report_child_jobs_separately" (current value: "false", data source: default, level: 9 dev/all, type: bool) Return the exit status of the primary job only Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_stat_history_size" (current value: "1", data source: default, level: 9 dev/all, type: int) Number of stat samples to keep MCA orte: parameter "orte_forward_envars" (current value: "", data source: default, level: 9 dev/all, type: string) Comma-delimited environmental variables to forward, can include value to set MCA orte: parameter "orte_max_vm_size" (current value: "-1", data source: default, level: 9 dev/all, type: int) Maximum size of virtual machine - used to subdivide allocation MCA orte: parameter "orte_set_default_slots" (current value: "cores", data source: default, level: 9 dev/all, type: string) Set the number of slots on nodes that lack such info to the number of specified objects [a number, "cores" (default), "numas", "sockets", "hwthreads" (default if hwthreads_as_cpus is set), or "none" to skip this option] MCA orte: parameter "orte_display_alloc" (current value: "false", data source: default, level: 9 dev/all, type: bool, synonyms: ras_base_display_alloc) Whether to display the allocation after it is determined Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_display_devel_alloc" (current value: "false", data source: default, level: 9 dev/all, type: bool, synonyms: ras_base_display_devel_alloc) Whether to display a developer-detail allocation after it is determined Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_soft_locations" (current value: "false", data source: default, level: 9 dev/all, type: bool) Treat -host directives as desired, but not required Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_daemon_cores" (current value: "", data source: default, level: 5 tuner/detail, type: string) Restrict the ORTE daemons (including mpirun) to operate on the specified cores (comma-separated list of ranges) MCA orte: parameter "orte_rsh_agent" (current value: "ssh : rsh", data source: default, level: 9 dev/all, type: string, deprecated, synonym of: plm_rsh_agent) The command used to launch executables on remote nodes (typically either "ssh" or "rsh") MCA orte: parameter "orte_assume_same_shell" (current value: "true", data source: default, level: 9 dev/all, type: bool, synonym of: plm_rsh_assume_same_shell) If set to true, assume that the shell on the remote node is the same as the shell on the local node. Otherwise, probe for what the remote shell [default: 1] Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_rankfile" (current value: "", data source: default, level: 5 tuner/detail, type: string, synonym of: rmaps_rank_file_path) Name of the rankfile to be used for mapping processes (relative or absolute path) MCA orte: parameter "orte_base_help_aggregate" (current value: "true", data source: default, level: 9 dev/all, type: bool) If orte_base_help_aggregate is true, duplicate help messages will be aggregated rather than displayed individually. This can be helpful for parallel jobs that experience multiple identical failures; rather than print out the same help/failure message N times, display it once with a count of how many processes sent the same message. Valid values: 0: f|false|disabled, 1: t|true|enabled MCA orte: parameter "orte_ess_jobid" (current value: "", data source: default, level: 9 dev/all, type: string, synonym of: ess_base_jobid) Process jobid MCA orte: parameter "orte_ess_vpid" (current value: "", data source: default, level: 9 dev/all, type: string, synonym of: ess_base_vpid) Process vpid MCA orte: parameter "orte_ess_num_procs" (current value: "-1", data source: default, level: 9 dev/all, type: int, synonym of: ess_base_num_procs) Used to discover the number of procs in the job MCA dfs: parameter "dfs" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the dfs framework ( means use all components that can be found) MCA dfs: parameter "dfs_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the dfs framework (0 = no verbosity) MCA dfs: parameter "dfs_test_select" (current value: "false", data source: default, level: 9 dev/all, type: bool) Apps select the test plug-in for the DFS framework Valid values: 0: f|false|disabled, 1: t|true|enabled MCA dfs: parameter "dfs_orted_num_worker_threads" (current value: "0", data source: default, level: 9 dev/all, type: int) Number of worker threads to use for processing file requests MCA errmgr: parameter "errmgr" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the errmgr framework ( means use all components that can be found) MCA errmgr: parameter "errmgr_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the errmgr framework (0 = no verbosity) MCA errmgr: parameter "errmgr_default_tool_priority" (current value: "1000", data source: default, level: 9 dev/all, type: int) Priority of the default_tool errmgr component MCA errmgr: parameter "errmgr_default_hnp_priority" (current value: "1000", data source: default, level: 9 dev/all, type: int) Priority of the default_hnp errmgr component MCA errmgr: parameter "errmgr_default_orted_priority" (current value: "1000", data source: default, level: 9 dev/all, type: int) Priority of the default_orted errmgr component MCA errmgr: parameter "errmgr_default_app_priority" (current value: "1000", data source: default, level: 9 dev/all, type: int) Priority of the default_app errmgr component MCA ess: parameter "ess" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the ess framework ( means use all components that can be found) MCA ess: parameter "ess_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the ess framework (0 = no verbosity) MCA ess: parameter "ess_base_stream_buffering" (current value: "default", data source: default, level: 9 dev/all, type: int) Adjust buffering for stdout/stderr [-1 system default] [0 unbuffered] [1 line buffered] [2 fully buffered](Default: -1) Valid values: -1:"default", 0:"unbuffered", 1:"line_buffered", 2:"fully_buffered" MCA ess: parameter "ess_singleton_server" (current value: "", data source: default, level: 9 dev/all, type: string, synonyms: server) Server to be used as HNP - [file|FILE]: or just uri MCA filem: parameter "filem" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the filem framework ( means use all components that can be found) MCA filem: parameter "filem_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the filem framework (0 = no verbosity) MCA filem: parameter "filem_raw_flatten_directory_trees" (current value: "false", data source: default, level: 9 dev/all, type: bool) Put all files in the working directory instead of creating their respective directory trees Valid values: 0: f|false|disabled, 1: t|true|enabled MCA grpcomm: parameter "grpcomm" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the grpcomm framework ( means use all components that can be found) MCA grpcomm: parameter "grpcomm_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the grpcomm framework (0 = no verbosity) MCA iof: parameter "iof" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the iof framework ( means use all components that can be found) MCA iof: parameter "iof_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the iof framework (0 = no verbosity) MCA iof: parameter "iof_base_output_limit" (current value: "2147483647", data source: default, level: 9 dev/all, type: size_t) Maximum backlog of output messages [default: unlimited] MCA iof: parameter "iof_base_input_files" (current value: "", data source: default, level: 9 dev/all, type: string) Comma-separated list of input files to be read and sent to stdin of procs (default: NULL) MCA odls: parameter "odls" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the odls framework ( means use all components that can be found) MCA odls: parameter "odls_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the odls framework (0 = no verbosity) MCA odls: parameter "odls_base_sigkill_timeout" (current value: "1", data source: default, level: 9 dev/all, type: int) Time to wait for a process to die after issuing a kill signal to it MCA oob: parameter "oob" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the oob framework ( means use all components that can be found) MCA oob: parameter "oob_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the oob framework (0 = no verbosity) MCA oob: parameter "oob_base_enable_module_progress_threads" (current value: "false", data source: default, level: 9 dev/all, type: bool) Whether to independently progress OOB messages for each interface Valid values: 0: f|false|disabled, 1: t|true|enabled MCA oob: parameter "oob_tcp_peer_limit" (current value: "-1", data source: default, level: 9 dev/all, type: int) Maximum number of peer connections to simultaneously maintain (-1 = infinite) MCA oob: parameter "oob_tcp_peer_retries" (current value: "2", data source: default, level: 9 dev/all, type: int) Number of times to try shutting down a connection before giving up MCA oob: parameter "oob_tcp_sndbuf" (current value: "131072", data source: default, level: 9 dev/all, type: int) TCP socket send buffering size (in bytes) MCA oob: parameter "oob_tcp_rcvbuf" (current value: "131072", data source: default, level: 9 dev/all, type: int) TCP socket receive buffering size (in bytes) MCA oob: parameter "oob_tcp_if_include" (current value: "", data source: default, level: 9 dev/all, type: string, synonyms: oob_tcp_include) Comma-delimited list of devices and/or CIDR notation of TCP networks to use for Open MPI bootstrap communication (e.g., "eth0,192.168.0.0/16"). Mutually exclusive with oob_tcp_if_exclude. MCA oob: parameter "oob_tcp_if_exclude" (current value: "", data source: default, level: 9 dev/all, type: string, synonyms: oob_tcp_exclude) Comma-delimited list of devices and/or CIDR notation of TCP networks to NOT use for Open MPI bootstrap communication -- all devices not matching these specifications will be used (e.g., "eth0,192.168.0.0/16"). If set to a non-default value, it is mutually exclusive with oob_tcp_if_include. MCA oob: parameter "oob_tcp_static_ipv4_ports" (current value: "", data source: default, level: 9 dev/all, type: string) Static ports for daemons and procs (IPv4) MCA oob: parameter "oob_tcp_dynamic_ipv4_ports" (current value: "", data source: default, level: 9 dev/all, type: string) Range of ports to be dynamically used by daemons and procs (IPv4) MCA oob: parameter "oob_tcp_disable_ipv4_family" (current value: "false", data source: default, level: 9 dev/all, type: bool) Disable the IPv4 interfaces Valid values: 0: f|false|disabled, 1: t|true|enabled MCA plm: parameter "plm" (current value: "", data source: default, level: 2 user/detail, type: string) Default selection set of components for the plm framework ( means use all components that can be found) MCA plm: parameter "plm_base_verbose" (current value: "0", data source: default, level: 8 dev/detail, type: int) Verbosity level for the plm framework (0 = no verbosity) MCA plm: parameter "plm_slurm_args" (current value: "", data source: default, level: 9 dev/all, type: string) Custom arguments to srun MCA plm: parameter "plm_rsh_num_concurrent" (current value: "128", data source: default, level: 9 dev/all, type: int) How many plm_rsh_agent instances to invoke concurrently (must be > 0) MCA plm: parameter "plm_rsh_force_rsh" (current value: "false", data source: default, level: 9 dev/all, type: bool) Force the launcher to always use rsh Valid values: 0: f|false|disabled, 1: t|true|enabled MCA plm: parameter "plm_rsh_disable_qrsh" (current value: "false", data source: default, level: 9 dev/all, type: bool) Disable the launcher to use qrsh when under the Grid Engine parallel environment Valid values: 0: f|false|disabled, 1: t|true|enabled MCA plm: parameter "plm_rsh_daemonize_qrsh" (current value: "false", data source: default, level: 9 dev/all, type: bool) Daemonize the orted under the Grid Engine parallel environment Valid values: 0: f|false|disabled, 1: t|true|enabled MCA plm: parameter "plm_rsh_disable_llspawn" (current value: "false", data source: default, level: 9 dev/all, type: bool) Disable the use of llspawn when under the LoadLeveler environment Valid values: 0: f|false|disabled, 1: t|true|enabled MCA plm: parameter "plm_rsh_daemonize_llspawn" (current value: "false", data source: default, level: 9 dev/all, type: bool) Daemonize the orted when under the LoadLeveler environment Valid values: 0: f|false|disabled, 1: t|true|enabled MCA plm: parameter "plm_rsh_priority" (current value: "10", data source: default, level: 9 dev/all, type: int) Priority of the rsh plm component MCA plm: parameter "plm_rsh_delay" (current value: "", data source: default, level: 9 dev/all, type: string) Delay between invocations of the remote agent (sec[:usec]) MCA plm: parameter "plm_rsh_no_tree_spawn" (current value: "false", data source: default, level: 9 dev/all, type: bool) If set to true, do not launch via a tree-based topology Valid values: 0: f|false|disabled, 1: t|true|enabled MCA plm: parameter "plm_rsh_agent" (current value: "ssh : rsh", data source: default, level: 9 dev/all, type: string, synonyms: pls_rsh_agent, orte_rsh_agent) The command used to launch executables on remote nodes (typically either "ssh" or "rsh") MCA plm: parameter "plm_rsh_assume_same_shell" (current value: "true", data source: default, level: 9 dev/all, type: bool, synonyms: orte_assume_same_shell) If set to true, assume that the shell on the remote node is the same as the shell on the local node. Otherwise, probe for what the remote shell [default: 1] Valid values: 0: f|false|disabled, 1: t|true|enabled MCA plm: parameter "plm_rsh_pass_environ_mca_params" (current value: "true", data source: default, level: 9 dev/all, type: bool) If set to false, do not include mca params from the environment on the orted cmd line Valid values: 0: f|false|disabled, 1: t|true|enabled MCA plm: parameter "plm_tm_want_path_check" (current value: "true", data source: default, level: 9 dev/all, type: bool) Whether the launching process should check for the plm_tm_o

ggouaillardet commented 8 years ago

@eediaz1987 could you please run

ldd bin/mpi_hello

and post the output ?

you can also try

`which mpirun` -np 1bin/mpi_hello

and see if you are luckier with that

eediaz1987 commented 8 years ago

this is the output for the first command

UOeediaz@hn:~$ ldd bin/mpi_hello linux-vdso.so.1 => (0x00007ffcd4b0c000) libmpi.so.0 => /usr/lib/libmpi.so.0 (0x00007fdd2f7fc000) libopen-rte.so.0 => /usr/lib/libopen-rte.so.0 (0x00007fdd2f5ae000) libopen-pal.so.0 => /usr/lib/libopen-pal.so.0 (0x00007fdd2f357000) libdl.so.2 => /lib/x86_64-linux-gnu/libdl.so.2 (0x00007fdd2f153000) libnsl.so.1 => /lib/x86_64-linux-gnu/libnsl.so.1 (0x00007fdd2ef3b000) libutil.so.1 => /lib/x86_64-linux-gnu/libutil.so.1 (0x00007fdd2ed38000) libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007fdd2eab6000) libpthread.so.0 => /lib/x86_64-linux-gnu/libpthread.so.0 (0x00007fdd2e89a000) libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007fdd2e50f000) /lib64/ld-linux-x86-64.so.2 (0x00007fdd2faae000)

and this is the output for the second UOeediaz@hn:~$ which mpirun -np 1 bin/mpi_hello /opt/librarieshpc/easybuild/software/OpenMPI/1.8.4-GCC-4.9.2/bin/mpirun bin/mpi_hello

2016-04-07 9:35 GMT-05:00 Gilles Gouaillardet notifications@github.com:

@eediaz1987 https://github.com/eediaz1987 could you please run

ldd bin/mpi_hello

and post the output ?

you can also try

which mpirun -np 1bin/mpi_hello

and see if you are luckier with that

— You are receiving this because you were mentioned. Reply to this email directly or view it on GitHub https://github.com/open-mpi/ompi/issues/1510#issuecomment-206934150

rhc54 commented 8 years ago

@eediaz1987 If you look at the output, you can clearly see the problem - you built mpi_hello against a library in /usr, and mpirun is in /opt

You need to fix your path and rebuild mpi_hello

eediaz1987 commented 8 years ago

thanks, i will the mpi_hello and i will try again

2016-04-08 13:32 GMT-05:00 rhc54 notifications@github.com:

@eediaz1987 https://github.com/eediaz1987 If you look at the output, you can clearly see the problem - you built mpi_hello against a library in /usr, and mpirun is in /opt

You need to fix your path and rebuild mpi_hello

— You are receiving this because you were mentioned. Reply to this email directly or view it on GitHub https://github.com/open-mpi/ompi/issues/1510#issuecomment-207548929

jsquyres commented 8 years ago

@eediaz1987 I'm going to close this issue, since it looks like the root cause has been found. Feel free to reply / re-open the issue if the issue still isn't solved.