adoptium / infrastructure

This repo contains all information about machine maintenance.
Apache License 2.0
84 stars 101 forks source link

Differences in runtime between build and test sparcv9 boxes #2729

Open Haroon-Khel opened 1 year ago

Haroon-Khel commented 1 year ago

The test job https://ci.adoptopenjdk.net/job/Test_openjdk8_hs_extended.openjdk_sparcv9_solaris_testList_1/ will run for 9h+ (and get aborted) on build-siteox-solaris10u11-sparcv9-1 and 4h on test-siteox-solaris10u11-sparcv9-1

Haroon-Khel commented 1 year ago
bash-3.2# psrinfo -pv
The physical processor has 4 virtual processors (0-3)
  UltraSPARC-T2+ (chipid 0, clock 1415 MHz)

Both machines have 4 virtual processors

zdtsw commented 1 year ago

so the test case "hotspot_jre_0_FAILED" failed everytime when run on build-siteox-solaris10u11-sparcv9-1

2022-09-03T15:34:54.606Z] TEST RESULT: Error. Program `/export/home/jenkins/workspace/Test_openjdk8_hs_extended.openjdk_sparcv9_solaris_testList_1/openjdkbinary/j2sdk-image/bin/java' timed out (timeout set to 2400000ms, elapsed time including timeout handling was 2401769ms).
[2022-09-03T15:34:54.606Z] --------------------------------------------------
[2022-09-03T16:24:42.542Z] Test results: passed: 711; error: 1

and looks like most of the tests run on this machine takes longer time than on the other one

sxa commented 1 year ago

FYI as discussed @steelhead31

sxa commented 1 year ago

@Haroon-Khel Can you summarise the data in here - I believe you've been running comparable openjdk runs on the TC machine too now as a third machine/data point for that comparison.

Haroon-Khel commented 1 year ago

Annoyingly the extended job that I ran on our TC sparc solaris machine did not run smoothly. Every test failed with an error complaining about not being able to find the test file. I'll kick it off again.

Haroon-Khel commented 1 year ago

So far the differences in runtime have been from the https://ci.adoptopenjdk.net/job/Test_openjdk8_hs_extended.openjdk_sparcv9_solaris_testList_1/ job. I'm interested to see if the runtimes differ if the extended openjdk job is run in a grinder instead

https://ci.adoptopenjdk.net/view/Test_grinder/job/Grinder/5682/console build-siteox-solaris10u11-sparcv9-1 https://ci.adoptopenjdk.net/view/Test_grinder/job/Grinder/5683/console test-siteox-solaris10u11-sparcv9-1

Haroon-Khel commented 1 year ago

May have found the difference

Build machine

bash-3.2# /usr/sbin/psrinfo -v
Status of virtual processor 0 as of: 09/11/2022 23:42:23
  on-line since 05/13/2022 08:09:57.
  The sparcv9 processor operates at 1415 MHz,
        and has a sparcv9 floating point processor.
Status of virtual processor 1 as of: 09/11/2022 23:42:23
  on-line since 05/13/2022 08:09:59.
  The sparcv9 processor operates at 1415 MHz,
        and has a sparcv9 floating point processor.
Status of virtual processor 2 as of: 09/11/2022 23:42:23
  on-line since 05/13/2022 08:09:59.
  The sparcv9 processor operates at 1415 MHz,
        and has a sparcv9 floating point processor.
Status of virtual processor 3 as of: 09/11/2022 23:42:23
  on-line since 05/13/2022 08:09:59.
  The sparcv9 processor operates at 1415 MHz,
        and has a sparcv9 floating point processor.

Test machine

bash-3.2# /usr/sbin/psrinfo -v
Status of virtual processor 0 as of: 09/12/2022 09:02:26
  on-line since 01/09/2022 21:46:53.
  The sparcv9 processor operates at 3600 MHz,
        and has a sparcv9 floating point processor.
Status of virtual processor 1 as of: 09/12/2022 09:02:26
  on-line since 01/09/2022 21:47:38.
  The sparcv9 processor operates at 3600 MHz,
        and has a sparcv9 floating point processor.
Status of virtual processor 2 as of: 09/12/2022 09:02:26
  on-line since 01/09/2022 21:47:38.
  The sparcv9 processor operates at 3600 MHz,
        and has a sparcv9 floating point processor.
Status of virtual processor 3 as of: 09/12/2022 09:02:26
  on-line since 01/09/2022 21:47:38.
  The sparcv9 processor operates at 3600 MHz,
        and has a sparcv9 floating point processor.

The test machine has more than double the clock speed for each of its cpus compared to the build machine, which makes sense since the runtime on the build machine is double that of the test machine. @sxa Can we look at getting build-siteox-solaris10u11-sparcv9-1 upgraded?