dragonwell-releng / dragonwell-adoptium

0 stars 0 forks source link

【dragonwell11】x86运行vmTestbase/nsk/monitoring/stress/lowmem/lowmem007/TestDescription.java有17%失败概率:User specified action: run main/othervm -XX:-UseGCOverheadLimit nsk.monitoring.stress.lowmem.lowmem001 -memory=mixed #324

Closed owanqian closed 8 months ago

owanqian commented 8 months ago

https://tone.aliyun-inc.com/ws/xesljfzh/test_result/263375

【环境准备】

wget -O binary.tar.gz  https://dragonwell.oss-cn-shanghai.aliyuncs.com/11.0.21.18.9-test/Alibaba_Dragonwell_Extended_11.0.21.18.9_x64_linux.tar.gz
wget  -O test-image.tar.gz https://dragonwell.oss-cn-shanghai.aliyuncs.com/11.0.21.18.9-test/Alibaba_Dragonwell_Extended_11.0.21.18.9_x64_linux-testimage.tar.gz

mkdir binary-download && mkdir test-image

tar xzvf binary.tar.gz -C binary-download
tar xzvf test-image.tar.gz -C test-image

cd binary-download && export JAVA_HOME=$PWD ; export PATH=$JAVA_HOME/bin:$PATH ; export TEST_JDK_HOME=$JAVA_HOME && cd -

wget -O jtreg.zip http://114.55.64.175:8666/compiler-ci-bucket/tools/jtreg-6.1.1.zip
unzip jtreg.zip
cd jtreg 
export JT_HOME=$PWD ; export PATH=$PWD/bin:$PATH
cd -

git clone https://github.com/dragonwell-project/dragonwell11.git jdk-repo -b dragonwell_extended-11.0.21.18.9 jdk-repo

native=-nativepath:./test-image/hotspot/jtreg/native
args='-XX:+UseCompactObjectHeaders'
test=vmTestbase/nsk/monitoring/stress/lowmem/lowmem007/TestDescription.java

jtreg  -nr -v:fail,error -w tmp  $args $native $test

相同配置的测试机上,进行手工复现,未复现. tone测试3000次失败了508次.其中exitcode 137:359次 timeout:149次 https://tone.aliyun-inc.com/ws/xesljfzh/test_result/264355

【用例日志】 测试过程中的三种错误日志: 134code是Tone首次测试失败的log. 137code和timeout是单测用例3000次出现的日志. 137code.txt 134code.txt timeout.txt

ACTION: main -- Failed. Unexpected exit from test [exit code: 134]
REASON: User specified action: run main/othervm -XX:-UseGCOverheadLimit nsk.monitoring.stress.lowmem.lowmem001 -memory=mixed 
TIME:   80.687 seconds
messages:
command: main -XX:-UseGCOverheadLimit nsk.monitoring.stress.lowmem.lowmem001 -memory=mixed
reason: User specified action: run main/othervm -XX:-UseGCOverheadLimit nsk.monitoring.stress.lowmem.lowmem001 -memory=mixed 
Mode: othervm [/othervm specified]
elapsed time (seconds): 80.687
configuration:
STDOUT:
Stress time: 60 seconds
Stress iterations factor: 1
Stress threads factor: 1
Stress runs factor: 1
Max memory: 16257122304
Sleep time: 500
Iterations: 0
Number of threads: 64
Seed: 1703848890891
Run GC thread: false
Run mem diag thread: false
Run forever: false
Starting Thread[Thread[Thread-2,5,MainThreadGroup],5,MainThreadGroup]
MemoryMonitor   > Changing threshold, pool: CodeHeap 'non-nmethods', nullnull, new threshold: 3604479
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@266a30, usage: init = 2555904(2496K) used = 1205120(1176K) committed = 3604480(3520K) max = 8196096(8004K), threshold: 3604479
MemoryMonitor   > Changing threshold, pool: Metaspace, nullnull, new threshold: 1268508738
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@6f0a81, usage: init = 0(0K) used = 429306160(419244K) committed = 1174265856(1146744K) max = -1(-1K), threshold: 1268508738
MemoryMonitor   > Changing threshold, pool: CodeHeap 'profiled nmethods', nullnull, new threshold: 3578368
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@17a33b8, usage: init = 2555904(2496K) used = 3552256(3469K) committed = 3604480(3520K) max = 121729024(118876K), threshold: 3578368
MemoryMonitor   > Changing threshold, pool: Compressed Class Space, nullnull, new threshold: 871440736
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@1e2d461, usage: init = 0(0K) used = 236548272(231004K) committed = 871497728(851072K) max = 1073741824(1048576K), threshold: 871440736
MemoryMonitor   > Changing threshold, pool: G1 Old Gen, nullnull, new threshold: 4077690416
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@15f9a1a, usage: init = 964689920(942080K) used = 1801010272(1758799K) committed = 6354370560(6205440K) max = 16257122304(15876096K), threshold: 4077690416
MemoryMonitor   > Changing threshold, pool: CodeHeap 'non-profiled nmethods', nullnull, new threshold: 1996352
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@1b4b8f2, usage: init = 2555904(2496K) used = 1436800(1403K) committed = 2555904(2496K) max = 121733120(118880K), threshold: 1996352
MemoryMonitor   > 1600 notification "Memory usage exceeds usage threshold" is caught on Fri Dec 29 19:22:00 CST 2023 by G1 Old Gen (init = 964689920(942080K) used = 4364173312(4261888K) committed = 7260340224(7090176K) max = 16257122304(15876096K))
MemoryMonitor   > Changing threshold, pool: CodeHeap 'non-nmethods', nullnull, new threshold: 3604479
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@266a30, usage: init = 2555904(2496K) used = 1205120(1176K) committed = 3604480(3520K) max = 8196096(8004K), threshold: 3604479
MemoryMonitor   > Changing threshold, pool: Metaspace, nullnull, new threshold: 1259922465
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@6f0a81, usage: init = 0(0K) used = 675215888(659390K) committed = 1251336192(1222008K) max = -1(-1K), threshold: 1259922465
MemoryMonitor   > Changing threshold, pool: CodeHeap 'profiled nmethods', nullnull, new threshold: 3578368
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@17a33b8, usage: init = 2555904(2496K) used = 3552256(3469K) committed = 3604480(3520K) max = 121729024(118876K), threshold: 3578368
MemoryMonitor   > Changing threshold, pool: Compressed Class Space, nullnull, new threshold: 871469232
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@1e2d461, usage: init = 0(0K) used = 374945600(366157K) committed = 871497728(851072K) max = 1073741824(1048576K), threshold: 871469232
MemoryMonitor   > Changing threshold, pool: G1 Old Gen, nullnull, new threshold: 5812256768
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@15f9a1a, usage: init = 964689920(942080K) used = 4364173312(4261888K) committed = 7260340224(7090176K) max = 16257122304(15876096K), threshold: 5812256768
MemoryMonitor   > Changing threshold, pool: CodeHeap 'non-profiled nmethods', nullnull, new threshold: 1996352
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@1b4b8f2, usage: init = 2555904(2496K) used = 1436800(1403K) committed = 2555904(2496K) max = 121733120(118880K), threshold: 1996352
MemoryMonitor   > 1601 notification "Memory usage exceeds usage threshold" is caught on Fri Dec 29 19:22:01 CST 2023 by G1 Old Gen (init = 964689920(942080K) used = 6325010432(6176768K) committed = 8317304832(8122368K) max = 16257122304(15876096K))
MemoryMonitor   > Changing threshold, pool: CodeHeap 'non-nmethods', nullnull, new threshold: 3604479
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@266a30, usage: init = 2555904(2496K) used = 1205120(1176K) committed = 3604480(3520K) max = 8196096(8004K), threshold: 3604479
MemoryMonitor   > Changing threshold, pool: Metaspace, nullnull, new threshold: 1280926224
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@6f0a81, usage: init = 0(0K) used = 766182104(748224K) committed = 1301929984(1271416K) max = -1(-1K), threshold: 1280926224
MemoryMonitor   > Changing threshold, pool: CodeHeap 'profiled nmethods', nullnull, new threshold: 3578368
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@17a33b8, usage: init = 2555904(2496K) used = 3552256(3469K) committed = 3604480(3520K) max = 121729024(118876K), threshold: 3578368
MemoryMonitor   > Changing threshold, pool: Compressed Class Space, nullnull, new threshold: 871483480
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@1e2d461, usage: init = 0(0K) used = 426152144(416164K) committed = 871497728(851072K) max = 1073741824(1048576K), threshold: 871483480
MemoryMonitor   > Changing threshold, pool: G1 Old Gen, nullnull, new threshold: 7321157632
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@15f9a1a, usage: init = 964689920(942080K) used = 6325010432(6176768K) committed = 8317304832(8122368K) max = 16257122304(15876096K), threshold: 7321157632
MemoryMonitor   > Changing threshold, pool: CodeHeap 'non-profiled nmethods', nullnull, new threshold: 1996352
MemoryMonitor   > Usage threshold is set, pool: sun.management.MemoryPoolImpl@1b4b8f2, usage: init = 2555904(2496K) used = 1436800(1403K) committed = 2555904(2496K) max = 121733120(118880K), threshold: 1996352
#
# A fatal error has been detected by the Java Runtime Environment:
#
#  SIGSEGV (0xb) at pc=0x00007f6f227a53df, pid=2198740, tid=2198848
#
# JRE version: OpenJDK Runtime Environment (Alibaba Dragonwell Extended Edition)-11.0.21.18+9-GA (11.0.21.18+9) (build 11.0.21.18+9)
# Java VM: OpenJDK 64-Bit Server VM (Alibaba Dragonwell Extended Edition)-11.0.21.18+9-GA (11.0.21.18+9, mixed mode, tiered, compressed oops, g1 gc, linux-amd64)
# Problematic frame:
# V  [libjvm.so+0x7a53df]  G1ParEvacuateFollowersClosure::do_void()+0x47f
#
# Core dump will be written. Default location: /tmp/tone/run/jtreg/jt-work/hotspot_jtreg/vmTestbase/nsk/monitoring/stress/lowmem/lowmem007/TestDescription/core.2198740
#
# An error report file with more information is saved as:
# /tmp/tone/run/jtreg/jt-work/hotspot_jtreg/vmTestbase/nsk/monitoring/stress/lowmem/lowmem007/TestDescription/hs_err_pid2198740.log
#
# If you would like to submit a bug report, please visit:
#   mailto:dragonwell_use@googlegroups.com
#

【版本信息】

[root@iZbp1j2iu8t6nrjesrw8fhZ jtreg]# uname -a ; cat /etc/os-release ; free -h; lscpu| head -n 25;java -version; java -Xinternalversion
Linux iZbp1j2iu8t6nrjesrw8fhZ 5.10.134-15.an8.x86_64 #1 SMP Thu Jul 20 00:35:47 CST 2023 x86_64 x86_64 x86_64 GNU/Linux
NAME="Anolis OS"
VERSION="8.8"
ID="anolis"
ID_LIKE="rhel fedora centos"
VERSION_ID="8.8"
PLATFORM_ID="platform:an8"
PRETTY_NAME="Anolis OS 8.8"
ANSI_COLOR="0;31"
HOME_URL="https://openanolis.cn/"

              total        used        free      shared  buff/cache   available
Mem:           60Gi       765Mi        58Gi       2.0Mi       1.0Gi        59Gi
Swap:            0B          0B          0B
Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              32
On-line CPU(s) list: 0-31
Thread(s) per core:  2
Core(s) per socket:  16
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
BIOS Vendor ID:      Alibaba Cloud
CPU family:          6
Model:               106
Model name:          Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz
BIOS Model name:     pc-i440fx-2.1
Stepping:            6
CPU MHz:             3516.555
BogoMIPS:            5399.99
Hypervisor vendor:   KVM
Virtualization type: full
L1d cache:           48K
L1i cache:           32K
L2 cache:            1280K
L3 cache:            49152K
NUMA node0 CPU(s):   0-31
openjdk version "11.0.21.18" 2023-10-17
OpenJDK Runtime Environment (Alibaba Dragonwell Extended Edition)-11.0.21.18+9-GA (build 11.0.21.18+9)
OpenJDK 64-Bit Server VM (Alibaba Dragonwell Extended Edition)-11.0.21.18+9-GA (build 11.0.21.18+9, mixed mode)
OpenJDK 64-Bit Server VM (11.0.21.18+9) for linux-amd64 JRE (11.0.21.18+9), built on Dec 28 2023 07:13:54 by "" with gcc 7.5.0
sendaoYan commented 8 months ago

https://github.com/dragonwell-project/dragonwell11/issues/762 这里跟踪