Closed aetx closed 5 years ago
Hmmm, interesting. It does seem that we need to be a little more rigorous in determining whether the detected features actually work.
For the time being, a work around is to pass -DCAF_ENABLE_FAILED_IMAGES=FALSE
when configuring with CMake.
The solution to this will be to use introspection to compile and run a test program using the MPIX features. Fun times. This is not a high priority but should be easy to do.
The experimental/proposed ULFM features did not make it into MPI 4, furthermore, it was determined that a big re-work is required. As such we have turned off enabling ULFM support by default even when the build system detects it. While this does, in principle, still need fixing, I'm going to close this as "won't fix" until the ULFM implementation stabilizes a bit and we have something reliable to test against.
Defect/Bug Report
uname -a
:Linux jrl10 3.10.0-693.17.1.el7.x86_64 #1 SMP Thu Jan 25 20:13:58 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux
OpenCoarrays compiled using
Observed Behavior
When running
cmake
it seems to findMPIX_Comm_failure_get_acked
However, when running the code it is not implemented in ParaStationMPI 5.2.0 and the following error appears
When compiling with
-DCAF_ENABLE_FAILED_IMAGES=FALSE
the functions are still found, but the code runs without aborting.Expected Behavior
CMake should correctly identify whether the functions may be used or not.
Although this may be a bigger problem for this case, as CMake just checks if their symbols appear in the MPI library and they seem to be there.
Steps to Reproduce
Compile OpenCoarrays with failed image support, then run a program using coarrays.