gradle / test-retry-gradle-plugin

Gradle plugin to retry tests that have failed to mitigate test flakiness.
Apache License 2.0
218 stars 48 forks source link

:imagesdir: docs/images :toc: :toc-placement!: :figure-caption!: :caption!:

= Test Retry Gradle plugin

A Gradle plugin that augments Gradle’s built-in test task with the ability to retry tests that have failed.

image:["Version",link=""] image:["GitHub license",link=""]


== What it does

The plugin causes failed tests to be retried within the same task. After executing all tests, any failed tests are retried. The process repeats with tests that continue to fail until the maximum specified number of retries has been attempted, or there are no more failing tests.

By default, all failed tests passing on retry prevents the test task from failing. This mode prevents flaky tests from causing build failure. This setting can be changed so that flaky tests cause build failure, which can be used to identify flaky tests.

When something goes badly wrong and all tests start failing, it can be preferable to not keep retrying tests. This can happen for example if a disk fills up or a required database is not available. To avoid this, the plugin can be configured to stop retrying after a certain number of total test failures.

NOTE: Retrying tests alone is not a viable flaky test mitigation strategy. This plugin should only be used alongside processes for tracking and fixing discovered flaky tests.

== Usage

Apply the plugin using one of the two methods described on the[Gradle Plugin Portal], where the plugin is listed as org.gradle.test-retry. It is compatible with Gradle 5.0 and later versions.

By default, retrying is not enabled.

Retrying is configured per test task via the retry extension added to each task by the plugin.

.build.gradle: [source,groovy]

test { retry { maxRetries = 2 maxFailures = 20 failOnPassedAfterRetry = true } }

.build.gradle.kts: [source,kotlin]

test { retry { maxRetries.set(2) maxFailures.set(20) failOnPassedAfterRetry.set(true) } }

=== Limiting retry to CI builds

You may find that local developer builds do not benefit much from retry behaviour, particularly when those tests are invoked via your IDE. In that case we recommend enabling retry only for CI builds.

.build.gradle: [source,groovy]

boolean isCiServer = System.getenv().containsKey("CI") test { retry { if (isCiServer) { maxRetries = 2 maxFailures = 20 } failOnPassedAfterRetry = true } }

== The retry extension

The retry extension is of the following type:


package org.gradle.testretry;

import org.gradle.api.Action; import org.gradle.api.provider.Property; import org.gradle.api.provider.SetProperty; import org.gradle.api.tasks.testing.Test;



== Supported test frameworks

Other versions are likely to work as well, but are not tested.

[%header,cols=2*] |=== |Framework |Version Tested

|JUnit4 |4.13.2

|JUnit5 |5.9.2

|Spock |2.3-groovy-3.0

|TestNG |7.5 |===

=== Parameterized tests

In a few cases, test selection for testing frameworks limits the granularity at which tests can be retried. In each case, this plugin retries at worst at method level. For JUnit5 @ParameterizedTest, TestNG @Test(dataProvider = "..."), and Spock @Unroll tests the plugin will retry the entire method with all parameters including those that initially passed.

=== Test dependencies

The plugin supports retrying Spock @Stepwise tests and TestNG @Test(dependsOn = { … }) tests.

=== Custom test frameworks

Some projects may use test tasks with a custom TestFramework to execute tests. If this is the case, the plugin disables retries and emits the following warning:


Task :unsupportedTestTaskUnitTest Test retry requested for task :unsupportedTestTaskUnitTest with unsupported test framework CustomTestFramework - failing tests will not be retried

To avoid this warning, we can disable retries for the unsupported test task with:

.build.gradle: [source,groovy]

test.named('unsupportedTestTaskUnitTest') { retry { maxRetries = 0 } }

.build.gradle.kts: [source,kotlin]

tasks.named("unsupportedTestTaskUnitTest") { retry { maxRetries.set(0) } }

== Filtering

By default, all tests are eligible for retrying. The filter component of the test retry extension can be used to control which tests should be retried and which should not.

The decision to retry a test or not is based on the tests reported class name, regardless of the name of the test case or method. The annotations present or not on this class can also be used as the criteria.

.build.gradle: [source,groovy]

test { retry { maxRetries = 3 filter { // filter by qualified class name ( matches zero or more of any character) includeClasses.add("IntegrationTest") excludeClasses.add("*DatabaseTest")

        // filter by class level annotations
        // Note: @Inherited annotations are respected


== Retry on class-level

By default, individual tests are retried. The classRetry component of the test retry extension can be used to control which test classes must be retried as a whole unit. Test classes still have to pass the configured filter.

.build.gradle: [source,groovy]

test { retry { maxRetries = 3 classRetry { // configure by qualified class name ( matches zero or more of any character) includeClasses.add("StepWiseIntegrationTest")

        // configure by class level annotations
        // Note: @Inherited annotations are respected


== Reporting

=== Gradle

Each execution of a test is discretely reported in Gradle-generated XML and HTML reports.

image:gradle-reports-test-retry-reporting2.png[Gradle test reporting, align="center", title=Gradle HTML test report]

image:gradle-reports-test-retry-reporting.png[Gradle flaky test reporting, align="center", title=Flaky test reported Gradle HTML test report]

Similar to the XML and HTML reports, the console log will also report each individual test execution. Before retrying a failed test, Gradle will execute the whole test suite of the test task. This means that all executions of the same test may not be grouped in the console log.

image:gradle-console-test-retry-reporting.png[Gradle console reporting, align="center", title=Flaky test Gradle console output]

=== Develocity

Gradle build scans (--scan option) report discrete test executions as "Execution [N of total]" and will mark a test with both a failed and then a passed outcome as flaky.

image:gradle-build-scan-test-retry-reporting.png[Gradle build scan reporting, align="center", title="Gradle build scan test report", caption="Build scan Tests view"]

Flaky tests can also be visualized across many builds using the[Develocity Tests Dashboard].

image:gradle-enterprise-flaky-test-reporting.png[Develocity top tests report, align="center", title=Develocity top tests report]

=== IDEs

The plugin has been tested with[IDEA],[Eclipse IDE] and[Netbeans].

==== IDEA

When delegating test execution to Gradle, each execution is reported discretely as for the test reports. Running tests without Gradle delegation causes tests to not be retried.

image:idea-test-retry-reporting.png[IDEA test reporting, align="center", title=IDEA test retry reporting]

==== Eclipse

When delegating test execution to Gradle, each execution is reported discretely as for the test reports. Running tests without Gradle delegation causes tests to not be retried.

image:eclipse-test-retry-reporting.png[Eclipse test reporting, align="center", title=Eclipse test retry reporting]

==== Netbeans

Netbeans only shows the last execution of a test.

image:netbeans-test-retry-reporting.png[Netbeans test reporting, align="center", title=Netbeans test retry reporting]

=== CI tools

The plugin has been tested with the reporting of[TeamCity] and[Jenkins].

==== TeamCity

Flaky tests (tests being executed multiple times but with different results) are detected by TeamCity and marked as flaky. TeamCity lists each test that was executed and how often it was run in the build.

By default, TeamCity will fail your build[if at least one test fails]. When using failOnPassedAfterRetry = false (ie. the default for this plugin), this failure condition should be disabled.

image:teamcity-test-retry-reporting.png[Teamcity test reporting, align="center", title=TeamCity test retry reporting including flaky test detection]

==== Jenkins

Jenkins reports each test execution discretely.

image:jenkins-test-retry-reporting.png[Jenkins test reporting, align="center", title=Jenkins test retry reporting]