bbopt / nomad

NOMAD - A blackbox optimization software
https://nomad-4-user-guide.readthedocs.io/
GNU Lesser General Public License v3.0
116 stars 24 forks source link

Transient failure in ExampleAdvancedPSDMads test #72

Open paulapatience opened 2 years ago

paulapatience commented 2 years ago

After adding set -o pipefail after the hash-bang line in the runExampleTest.sh script (in order to detect failing library tests — see https://github.com/bbopt/nomad4dev/issues/707), the ExampleAdvancedPSDMads test fails, but seemingly only sometimes. Here is the relevant section of the log of my latest (failed) build (through Guix) of NOMAD:

starting phase `check'
Running tests...
/gnu/store/zga679c4nldah9l8dhd5a4hdy820hcyf-cmake-minimal-3.21.4/bin/ctest --force-new-ctest-process 
Test project /tmp/guix-build-nomad-optimizer-4.2.0.drv-0/build
      Start  1: Example1BasicLib
 1/21 Test  #1: Example1BasicLib ........................   Passed   23.50 sec
      Start  2: Example2BasicLib
 2/21 Test  #2: Example2BasicLib ........................   Passed    5.67 sec
      Start  3: Example3BasicLib
 3/21 Test  #3: Example3BasicLib ........................   Passed    1.77 sec
      Start  4: SingleObjParallelBasicLib
 4/21 Test  #4: SingleObjParallelBasicLib ...............   Passed    0.66 sec
      Start  5: ExampleAdvancedFixedVariables
 5/21 Test  #5: ExampleAdvancedFixedVariables ...........   Passed   50.30 sec
      Start  6: ExampleAdvancedNMOnly
 6/21 Test  #6: ExampleAdvancedNMOnly ...................   Passed    0.68 sec
      Start  7: ExampleAdvancedRestart
 7/21 Test  #7: ExampleAdvancedRestart ..................   Passed    6.48 sec
      Start  8: ExampleAdvancedSuggestAndObserve
 8/21 Test  #8: ExampleAdvancedSuggestAndObserve ........   Passed    0.09 sec
      Start  9: ExampleAdvancedLoopSuggestAndObserve
 9/21 Test  #9: ExampleAdvancedLoopSuggestAndObserve ....   Passed    0.45 sec
      Start 10: ExampleAdvancedPSDMads
10/21 Test #10: ExampleAdvancedPSDMads ..................***Failed    0.22 sec
Running Example test /tmp/guix-build-nomad-optimizer-4.2.0.drv-0/build/examples/advanced/library/PSDMads/rosenbrock.exe
Warning: Dimension 50 is greater than (or equal to) 50. Models are disabled.
BBE OBJ
1 162.5     
48 156.29    
60 153.49    
64 150.2     
67 151.48    
79 149.85    
87 137.4     
terminate called after throwing an instance of 'NOMAD_4_2::Exception'
  what():  NOMAD::Exception thrown (/tmp/guix-build-nomad-optimizer-4.2.0.drv-0/source/src/Eval/Barrier.cpp, 144) Barrier has points of size 2 and of size 50
/tmp/guix-build-nomad-optimizer-4.2.0.drv-0/build/examples/runExampleTest.sh: line 11:  1989 Aborted                 $executable $param_file
      1990 Done                    | tee runTestLog.txt

      Start 11: ExampleAdvancedC-API1
11/21 Test #11: ExampleAdvancedC-API1 ...................   Passed   34.83 sec
      Start 12: ExampleAdvancedC-API2
12/21 Test #12: ExampleAdvancedC-API2 ...................   Passed  229.05 sec
      Start 13: Example1BasicBatch
13/21 Test #13: Example1BasicBatch ......................   Passed    6.88 sec
      Start 14: Example2BasicBatch
14/21 Test #14: Example2BasicBatch ......................   Passed    9.38 sec
      Start 15: Example3BasicBatch
15/21 Test #15: Example3BasicBatch ......................   Passed    0.26 sec
      Start 16: ExampleSingleObjBasicBatch
16/21 Test #16: ExampleSingleObjBasicBatch ..............   Passed    6.53 sec
      Start 17: ExampleSurrogateSortBasicBatch
17/21 Test #17: ExampleSurrogateSortBasicBatch ..........   Passed    0.94 sec
      Start 18: ExampleBasicBatchSingleObjParallel
18/21 Test #18: ExampleBasicBatchSingleObjParallel ......   Passed    0.01 sec
      Start 19: ExampleAdvancedBatchLHOnly
19/21 Test #19: ExampleAdvancedBatchLHOnly ..............   Passed    0.12 sec
      Start 20: ExampleAdvancedBatchFixedVariables
20/21 Test #20: ExampleAdvancedBatchFixedVariables ......   Passed   60.65 sec
      Start 21: ExampleAdvancedBatchSuggestAndObserve
21/21 Test #21: ExampleAdvancedBatchSuggestAndObserve ...   Passed    0.03 sec

95% tests passed, 1 tests failed out of 21

Total Test time (real) = 438.50 sec

The following tests FAILED:
     10 - ExampleAdvancedPSDMads (Failed)
Errors while running CTest
make: *** [Makefile:139: test] Error 8

Test suite failed, dumping logs.
error: in phase 'check': uncaught exception:
%exception #<&invoke-error program: "make" arguments: ("test" "-j" "4") exit-status: 2 term-signal: #f stop-signal: #f> 
phase `check' failed after 438.6 seconds
command "make" "test" "-j" "4" failed with status 2

This issue is a duplicate of https://github.com/bbopt/nomad4dev/issues/711. I've created it in order to have a public issue to reference in the comments of the Guix package for NOMAD (which disables this particular test). See https://issues.guix.gnu.org/55067.

ctribes commented 2 years ago

This issue is related to openMP use. Very difficult to debug.