eclipse-openj9 / openj9

Eclipse OpenJ9: A Java Virtual Machine for OpenJDK that's optimized for small footprint, fast start-up, and high throughput. Builds on Eclipse OMR (https://github.com/eclipse/omr) and combines with the Extensions for OpenJDK for OpenJ9 repo.
Other
3.28k stars 721 forks source link

CRIU pie: 19618: Error (criu/pie/restorer.c:1839): prctl failed @1839 with -1 #19211

Open JasonFengJ9 opened 8 months ago

JasonFengJ9 commented 8 months ago

Failure link

From an internal build(sles12s390x-svl-rt3-1):

openjdk version "17.0.11" 2024-04-16
IBM Semeru Runtime Open Edition 17.0.11.0-m1 (build 17.0.11+6)
Eclipse OpenJ9 VM 17.0.11.0-m1 (build v0.44.0-release-747f86c91, JRE 17 Linux s390x-64-Bit Compressed References 20240416_666 (JIT enabled, AOT enabled)
OpenJ9   - 747f86c91
OMR      - 254af5a04
JCL      - c86cd885d3f based on jdk-17.0.11+6)

Rerun in Grinder - Change TARGET to run only the failed test targets.

Optional info

Failure output (captured from console output)

[2024-03-19T14:56:57.422Z] variation: -Xjit -XX:+CRIURestoreNonPortableMode
[2024-03-19T14:56:57.422Z] JVM_OPTIONS:  -Xjit -XX:+CRIURestoreNonPortableMode 

[2024-03-19T14:57:55.484Z] Testing: Create and Restore Criu Checkpoint Image once - TestSingleThreadModeRestoreException
[2024-03-19T14:57:55.484Z] Test start time: 2024/03/19 07:57:55 Pacific Standard Time
[2024-03-19T14:57:55.484Z] Running command: bash /home/jenkins/workspace/Test_openjdk17_j9_sanity.functional_s390x_linux_testList_1/aqa-tests/TKG/../../jvmtest/functional/cmdLineTests/criu/criuScript.sh /home/jenkins/workspace/Test_openjdk17_j9_sanity.functional_s390x_linux_testList_1/aqa-tests/TKG/../../jvmtest/functional/cmdLineTests/criu /home/jenkins/workspace/Test_openjdk17_j9_sanity.functional_s390x_linux_testList_1/jdkbinary/j2sdk-image/bin/java " -Xjit -XX:+CRIURestoreNonPortableMode " org.openj9.criu.TestSingleThreadModeRestoreException 1 1 false false
[2024-03-19T14:57:55.484Z] Time spent starting: 2 milliseconds
[2024-03-19T14:57:59.023Z] Time spent executing: 2878 milliseconds
[2024-03-19T14:57:59.023Z] Test result: FAILED
[2024-03-19T14:57:59.023Z] Output from test:
[2024-03-19T14:57:59.023Z]  [OUT] start running script
[2024-03-19T14:57:59.023Z]  [OUT] export GLIBC_TUNABLES=glibc.cpu.hwcaps=-XSAVEC,-XSAVE,-AVX2,-ERMS,-AVX,-AVX_Fast_Unaligned_Load
[2024-03-19T14:57:59.023Z]  [OUT] export LD_BIND_NOT=on
[2024-03-19T14:57:59.023Z]  [OUT] /home/jenkins/workspace/Test_openjdk17_j9_sanity.functional_s390x_linux_testList_1/jdkbinary/j2sdk-image/bin/java -XX:+EnableCRIUSupport  -Xjit -XX:+CRIURestoreNonPortableMode  -cp /home/jenkins/workspace/Test_openjdk17_j9_sanity.functional_s390x_linux_testList_1/aqa-tests/TKG/../../jvmtest/functional/cmdLineTests/criu/criu.jar org.openj9.criu.TestSingleThreadModeRestoreException 1 1
[2024-03-19T14:57:59.023Z]  [OUT] main: Tue Mar 19 07:57:55 PDT 2024, testSingleThreadModeRestoreExceptionSynLock() before synchronized on java.lang.Object@5e8f9e7b, System.currentTimeMillis(): 1710860275584, System.nanoTime(): 1710860164144366258
[2024-03-19T14:57:59.023Z]  [OUT] main: Tue Mar 19 07:57:55 PDT 2024, testSingleThreadModeRestoreExceptionSynLock() before doCheckpointSynLock(), System.currentTimeMillis(): 1710860275751, System.nanoTime(): 1710860164311600027
[2024-03-19T14:57:59.023Z]  [OUT] Pre-checkpoint
[2024-03-19T14:57:59.023Z]  [OUT] Thread-0: Tue Mar 19 07:57:55 PDT 2024, Performing CRIUSupport.checkpointJVM(), System.currentTimeMillis(): 1710860275754, System.nanoTime(): 1710860164314792055
[2024-03-19T14:57:59.023Z]  [OUT] initiate restore
[2024-03-19T14:57:59.023Z]  [OUT] pie: 19618: Error (criu/pie/restorer.c:1839): prctl failed @1839 with -1
[2024-03-19T14:57:59.023Z]  [OUT] pie: 19618: Error (criu/pie/restorer.c:1840): prctl failed @1840 with -1
[2024-03-19T14:57:59.023Z]  [OUT] pie: 19618: Error (criu/pie/restorer.c:1841): prctl failed @1841 with -1
[2024-03-19T14:57:59.023Z]  [OUT] pie: 19618: Error (criu/pie/restorer.c:1842): prctl failed @1842 with -1
[2024-03-19T14:57:59.023Z]  [OUT] pie: 19618: Error (criu/pie/restorer.c:1843): prctl failed @1843 with -1
[2024-03-19T14:57:59.023Z]  [OUT] pie: 19618: Error (criu/pie/restorer.c:1844): prctl failed @1844 with -1
[2024-03-19T14:57:59.023Z]  [OUT] pie: 19618: Error (criu/pie/restorer.c:1845): prctl failed @1845 with -1
[2024-03-19T14:57:59.023Z]  [OUT] pie: 19618: Error (criu/pie/restorer.c:1846): prctl failed @1846 with -1
[2024-03-19T14:57:59.023Z]  [OUT] pie: 19618: Error (criu/pie/restorer.c:1847): prctl failed @1847 with -1
[2024-03-19T14:57:59.023Z]  [OUT] pie: 19618: Error (criu/pie/restorer.c:1848): prctl failed @1848 with -1
[2024-03-19T14:57:59.023Z]  [OUT] pie: 19618: Error (criu/pie/restorer.c:1849): prctl failed @1849 with -1
[2024-03-19T14:57:59.023Z]  [OUT] pie: 19618: Error (criu/pie/restorer.c:1850): prctl failed @1850 with -1
[2024-03-19T14:57:59.023Z]  [OUT] pie: 19618: Error (criu/pie/restorer.c:777): prctl failed @777 with -1
[2024-03-19T14:57:59.023Z]  [OUT] pie: 19618: Error (criu/pie/restorer.c:779): Can't restore EXE link (-1)
[2024-03-19T14:57:59.023Z]  [OUT] pie: 19618: Error (criu/pie/restorer.c:2102): Restorer fail 19618
[2024-03-19T14:57:59.023Z]  [OUT] Error (criu/cr-restore.c:2547): Restoring FAILED.
[2024-03-19T14:57:59.023Z]  [OUT] Removed test output files
[2024-03-19T14:57:59.023Z]  [OUT] finished script
[2024-03-19T14:57:59.023Z]  [ERR] /home/jenkins/workspace/Test_openjdk17_j9_sanity.functional_s390x_linux_testList_1/aqa-tests/TKG/../../jvmtest/functional/cmdLineTests/criu/criuScript.sh: line 41: 19618 Killed                  $2 -XX:+EnableCRIUSupport $3 -cp "$1/criu.jar" $4 $5 $6 > testOutput 2>&1
[2024-03-19T14:57:59.023Z] >> Success condition was not found: [Output match: testSingleThreadModeRestoreExceptionJUCLock: PASSED]
[2024-03-19T14:57:59.023Z] >> Success condition was not found: [Output match: testSingleThreadModeRestoreExceptionSynLock: PASSED]
[2024-03-19T14:57:59.023Z] >> Required condition was found: [Output match: Pre-checkpoint]
[2024-03-19T14:57:59.023Z] >> Failure condition was not found: [Output match: testSingleThreadModeRestoreExceptionJUCLock: FAILED]
[2024-03-19T14:57:59.023Z] >> Failure condition was not found: [Output match: testSingleThreadModeRestoreExceptionSynLock: FAILED]
[2024-03-19T14:57:59.023Z] >> Required condition was found: [Output match: Killed]
[2024-03-19T14:57:59.023Z] >> Failure condition was not found: [Output match: CRIU is not enabled]
[2024-03-19T14:57:59.023Z] >> Failure condition was not found: [Output match: Operation not permitted]
[2024-03-19T14:57:59.023Z] >> Success condition was not found: [Output match: Thread pid mismatch]
[2024-03-19T14:57:59.023Z] >> Success condition was not found: [Output match: do not match expected]
[2024-03-19T14:57:59.023Z] >> Success condition was not found: [Output match: Unable to create a thread:]
[2024-03-19T14:57:59.023Z] >> Failure condition was not found: [Output match: Could not dump the JVM processes, err=-70]

[2024-03-19T15:00:10.749Z] ---TEST RESULTS---
[2024-03-19T15:00:10.749Z] Number of PASSED tests: 56 out of 64
[2024-03-19T15:00:10.749Z] Number of FAILED tests: 8 out of 64
[2024-03-19T15:00:10.749Z] 
[2024-03-19T15:00:10.749Z] ---SUMMARY OF FAILED TESTS---
[2024-03-19T15:00:10.749Z] Create and Restore Criu Checkpoint Image once - TestSingleThreadModeRestoreException
[2024-03-19T15:00:10.749Z] Create and Restore Criu Checkpoint Image once - MethodTypeDeadlockTest
[2024-03-19T15:00:10.749Z] Create and Restore Criu Checkpoint Image once - TestDelayedOperations
[2024-03-19T15:00:10.749Z] Envvar test6
[2024-03-19T15:00:10.749Z] Create Criu Checkpoint Image once and no restore - TestConcurrentModePreCheckpointHookPriorities
[2024-03-19T15:00:10.749Z] Create and Restore Criu Checkpoint Image once - TestConcurrentModePostRestoreHookThrowException
[2024-03-19T15:00:10.749Z] Create and Restore Criu Checkpoint Image once - TestConcurrentModePostRestoreHookRunOnce
[2024-03-19T15:00:10.749Z] Create and Restore Criu Checkpoint Image once - TestConcurrentModePostRestoreHookPriorities
[2024-03-19T15:00:10.749Z] -----------------------------
[2024-03-19T15:00:10.749Z] 
[2024-03-19T15:00:10.749Z] -----------------------------------
[2024-03-19T15:00:10.749Z] cmdLineTester_criu_nonPortableRestore_0_FAILED

50x internal grinder - https://github.com/eclipse-openj9/openj9/issues/19211#issuecomment-2015027681

FYI @tajila @ymanton

pshipton commented 8 months ago

A machine issue or sles12 unsupported?

Grinder Passed 90/90 on rhel9s390x-svl-rt3-1, rhel8s390x-svl-rt1-1 Failed 1/90 on ubu22s390x-rt-2 in testSystemNanoTimeJitPreCheckpointCompile Could not dump the JVM processes, err=-70 Failed 90/90 on sles12s390x-svl-rt6-1 in many tests with prctl failed Failed 1/90 on ubu20s390x-svl-rt6-1 in Envvar test6, failed to restore

JasonFengJ9 commented 8 months ago

A machine issue or sles12 unsupported?

sles12 is unsupported as per https://github.com/eclipse-openj9/openj9/pull/19179/files

pshipton commented 8 months ago

Jason the link shows sles 15+, did I miss something?

pshipton commented 8 months ago

Ah, nm I see you updated your comment.