ray-project / ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
https://ray.io
Apache License 2.0
33.28k stars 5.63k forks source link

CI test linux://rllib:TestLearnerGroupAsyncUpdate is flaky #45088

Open can-anyscale opened 5 months ago

can-anyscale commented 5 months ago

CI test linux://rllib:TestLearnerGroupAsyncUpdate is consistently_failing. Recent failures:

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END Managed by OSS Test Policy

can-anyscale commented 5 months ago

i think it's https://github.com/ray-project/ray/pull/44995/files

can-anyscale commented 5 months ago

This test is now considered as flaky because it has been failing on postmerge for too long. Flaky tests do not run on premerge.

can-anyscale commented 5 months ago

fixed by https://github.com/ray-project/ray/pull/45110

can-anyscale commented 5 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/4303#018f3b13-9580-4bb4-9e49-1a5474d115fe

can-anyscale commented 3 months ago

CI test linux://rllib:TestLearnerGroupAsyncUpdate is consistently_failing. Recent failures:

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END Managed by OSS Test Policy

can-anyscale commented 3 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/4760#018feefd-3476-4a3b-bde6-64f2786d3c9c

can-anyscale commented 3 months ago

Blamed commit: 937a8fd7935efa526c827cf7997ae46c483bc94e found by bisect job https://buildkite.com/ray-project/release-tests-bisect/builds/1214

can-anyscale commented 3 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/4768#018ff00b-2137-438c-b0c9-77783fa3a5c4

can-anyscale commented 3 months ago

CI test linux://rllib:TestLearnerGroupAsyncUpdate is consistently_failing. Recent failures:

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END Managed by OSS Test Policy

can-anyscale commented 3 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/4970#01901fad-9ad9-478b-a03b-c933845043cf

can-anyscale commented 3 months ago

Blamed commit: 70e5e78d7a50f1f6b2b1b8b0474df50a03056331 found by bisect job https://buildkite.com/ray-project/release-tests-bisect/builds/1242

can-anyscale commented 3 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/4981#01902498-339a-4524-9bb8-18ab6155a947

can-anyscale commented 3 months ago

CI test linux://rllib:TestLearnerGroupAsyncUpdate is consistently_failing. Recent failures:

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END Managed by OSS Test Policy

can-anyscale commented 3 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/5053#01903be7-ff23-4db5-9c90-dc1be74aa4d5

can-anyscale commented 3 months ago

CI test linux://rllib:TestLearnerGroupAsyncUpdate is consistently_failing. Recent failures:

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END Managed by OSS Test Policy

can-anyscale commented 3 months ago

Blamed commit: c942d6038ac9d209cbca6d6c8fbca7ca218f8c5f found by bisect job https://buildkite.com/ray-project/release-tests-bisect/builds/1270

can-anyscale commented 3 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/5091#019048f7-fdc8-42bc-9058-668f7f11f957

can-anyscale commented 3 months ago

CI test linux://rllib:TestLearnerGroupAsyncUpdate is consistently_failing. Recent failures:

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END Managed by OSS Test Policy

can-anyscale commented 3 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/5129#019055aa-257e-4458-8fd4-a68c2862347d

can-anyscale commented 3 months ago

Blamed commit: e9109e673bae59dacd08514d71aca826c42443d2 found by bisect job https://buildkite.com/ray-project/release-tests-bisect/builds/1278

can-anyscale commented 3 months ago

CI test linux://rllib:TestLearnerGroupAsyncUpdate is consistently_failing. Recent failures:

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END Managed by OSS Test Policy

can-anyscale commented 3 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/5225#0190750f-4300-4665-8f20-b54173f44fad

can-anyscale commented 1 month ago

CI test linux://rllib:TestLearnerGroupAsyncUpdate is consistently_failing. Recent failures:

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END Managed by OSS Test Policy

can-anyscale commented 1 month ago

This test is now considered as flaky because it has been failing on postmerge for too long. Flaky tests do not run on premerge.

can-anyscale commented 2 days ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6432#01924154-86cf-4745-8484-5cbae8ab2d5d

can-anyscale commented 1 day ago

CI test linux://rllib:TestLearnerGroupAsyncUpdate is flaky. Recent failures:

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END Managed by OSS Test Policy

can-anyscale commented 19 hours ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6444#019249b2-2274-4f1f-851b-bf79391b3820

can-anyscale commented 14 hours ago

CI test linux://rllib:TestLearnerGroupAsyncUpdate is flaky. Recent failures:

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END Managed by OSS Test Policy

can-anyscale commented 12 hours ago

CI test linux://rllib:TestLearnerGroupAsyncUpdate is flaky. Recent failures:

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END Managed by OSS Test Policy