adoptium / aqa-tests

Home of test infrastructure for Adoptium builds
https://adoptium.net/aqavit
Apache License 2.0
125 stars 304 forks source link

Auto rerun FAILURE state test jobs in test pipeline #5346

Open llxia opened 1 month ago

llxia commented 1 month ago

We support auto-rerun failed tests in the test pipeline (see #3431). It is useful for triaging and rerunning the failed tests automatically. However, this feature requires the test jobs to run to the completion (to get the failed test names). If the test jobs are in the FAILURE state (i.e., machine disconnected, machine out of space, etc), someone still needs to restart the jobs manually. This is very tedious during release time. We should have a feature to Auto rerun FAILURE state test jobs.

TRSS should support displaying the rerun test jobs. We should auto-archive the rerun jobs TAP into the parent job.

To simplify this, we will add this to the parent test job. This means a parent test job may trigger the following child test jobs:

FYI @pshipton @JasonFengJ9

smlambert commented 1 month ago

Please also see some of the recent changes done by @sophia-guo for archiving TAP files.

llxia commented 1 month ago

yes, all TAP files (child jobs, rerun failed tests, rerun FAILURE test jobs) should be archived.