adoptium / aqa-tests

Home of test infrastructure for Adoptium builds
https://adoptium.net/aqavit
Apache License 2.0
128 stars 308 forks source link

sparcv9_solaris crashes during jdk_jfr tests #4359

Open smlambert opened 1 year ago

smlambert commented 1 year ago

As reported in https://github.com/adoptium/aqa-tests/issues/4258#issuecomment-1398963195, small set of jdk_jfr testcases, jdk/jfr/jcmd/* testcases occasionally coredump executing jcmd (appears to be cases where JFR.stop is used, needs further investigation) (Example of test run at: https://ci.adoptium.net/job/Test_openjdk8_hs_extended.openjdk_sparcv9_solaris_testList_1/66/)

jdk_jfr 
TEST: jdk/jfr/event/gc/stacktrace/TestG1OldAllocationPendingStackTrace.java
TEST: jdk/jfr/javaagent/TestLoadedAgent.java
TEST: jdk/jfr/jcmd/TestJcmdSaveToFile.java
TEST: jdk/jfr/jcmd/TestJcmdStartWithOptions.java
TEST: jdk/jfr/jcmd/TestJcmdStartWithSettings.java
TEST: jdk/jfr/jcmd/TestJcmdStopInvalidFile.java
TEST: jdk/jfr/jcmd/TestJcmdStopReadOnlyFile.java
Test results: passed: 411; failed: 6; error: 1
Running DCMD 'JFR.stop name="testDuration"' through 'PidJcmdExecutor'
Executing command '[/export/home/jenkins/workspace/Test_openjdk8_hs_extended.openjdk_sparcv9_solaris_testList_1/openjdkbinary/j2sdk-image/bin/jcmd, 20816, JFR.stop name="testDuration"]'
#
# A fatal error has been detected by the Java Runtime Environment:
#
#  SIGSEGV (0xb) at pc=0xffffffff7d769804, pid=20816, tid=0x0000000000000017
#
# JRE version: OpenJDK Runtime Environment (8.0_362-b09) (build 1.8.0_362-b09)
# Java VM: OpenJDK 64-Bit Server VM (25.362-b09 mixed mode solaris-sparc compressed oops)
# Problematic frame:
# V  [libjvm.so+0x769804]  void JfrNetworkInterfaceName::serialize(JfrCheckpointWriter&)+0xec
#
# Core dump written. Default location: /export/home/jenkins/workspace/Test_openjdk8_hs_extended.openjdk_sparcv9_solaris_testList_1/aqa-tests/TKG/output_16744972695799/jdk_jfr_0/work/scratch/core or core.20816
#
# An error report file with more information is saved as:
# /export/home/jenkins/workspace/Test_openjdk8_hs_extended.openjdk_sparcv9_solaris_testList_1/aqa-tests/TKG/output_16744972695799/jdk_jfr_0/work/scratch/hs_err_pid20816.log
#
# If you would like to submit a bug report, please visit:
#   https://github.com/adoptium/adoptium-support/issues
#

Additional results/logs/core files from one of failing testcases TestJcmdSaveToFile: hs_err_pid20704.log TestJcmdSaveToFile.jtr.txt

adamfarley commented 1 year ago

We've seen a near-identical issue in TestGetEventTypes.java (as detailed [here]()). Here's the information:

Core Facts

Platform: Solaris SparcV9 JDK Version: JDK8u382 Test Super Target: extended.OpenJDK Test Target: jdk_jfr_0 Test: jdk/jfr/api/flightrecorder/TestGetEventTypes.java Link: https://ci.adoptium.net/job/Test_openjdk8_hs_extended.openjdk_sparcv9_solaris_testList_1/81/consoleFull

Additional Data

I've raised an upstream issue for this here

Useful files

Files can be found here.

This Google Drive folder includes log files, full test output, and a core file.

Next action

I plan to exclude this test as well.

Update: PR raised.