ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
https://ray.io
Apache License 2.0
34.18k stars 5.8k forks source link

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is flaky #47309

Open can-anyscale opened 3 months ago

can-anyscale commented 3 months ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 3 months ago

This test is now considered as flaky because it has been failing on postmerge for too long. Flaky tests do not run on premerge.

can-anyscale commented 3 months ago

Blamed commit: fd84b9d631864f5dfca6c4bb6806a7e9c5bc1126 found by bisect job https://buildkite.com/ray-project/release-tests-bisect/builds/1484

can-anyscale commented 3 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6001#01918219-ea19-447c-bf36-e0cdec7c66d0

can-anyscale commented 3 months ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is flaky. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 3 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6048#01919bd1-4a12-4271-b06d-aae1599dc7bc

can-anyscale commented 2 months ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 2 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6071#0191a1a0-6cc8-4908-a3e1-22d8011d28e0

can-anyscale commented 2 months ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 2 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6079#0191a1ad-7d55-4ae5-871f-cef93fc43b6b

can-anyscale commented 2 months ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is flaky. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 2 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6134#0191bf82-5d7a-4fe8-bddc-1e900e1a00f6

can-anyscale commented 2 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6135#0191c092-2264-40da-99b2-9cf16e074a46

can-anyscale commented 2 months ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is flaky. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 2 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6167#0191caa5-4d45-4ada-b123-123d47ba3098

can-anyscale commented 2 months ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is flaky. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 2 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6314#01920277-0f4d-4ac5-a6c2-2b61559afbfe

can-anyscale commented 2 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6321#01920388-94d0-4586-8087-cba83b5c22b5

can-anyscale commented 2 months ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 2 months ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6350#01921109-efe1-4258-9e08-2904993b2d3f

can-anyscale commented 1 month ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 1 month ago

Blamed commit: 289374af7fe2502594c83e5ed528dfb6e8d531fc found by bisect job https://buildkite.com/ray-project/release-tests-bisect/builds/1598

can-anyscale commented 1 month ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6441#01924679-9e6f-4e58-96f5-da3d0cca4c96

can-anyscale commented 1 month ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 1 month ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6448#01924f68-086b-4ec1-b6b4-7a7b02f4512f

can-anyscale commented 1 month ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 1 month ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 1 month ago

Blamed commit: cbde03cf8c42d4d817f81fdb506ca7378f162baa found by bisect job https://buildkite.com/ray-project/release-tests-bisect/builds/1609

can-anyscale commented 1 month ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6470#01925b13-547d-460f-9706-424a5e9551fb

can-anyscale commented 1 month ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is flaky. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 1 month ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6502#0192720b-4c41-49e5-8d44-eac1d5e3f814

can-anyscale commented 1 month ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 1 month ago

Blamed commit: d5fa9a04ed841ea845887f43e06a0d2a81216c2d found by bisect job https://buildkite.com/ray-project/release-tests-bisect/builds/1629

can-anyscale commented 1 month ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6584#01929b3d-4236-4790-a43e-f34c2f5a0ed7

can-anyscale commented 1 month ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 1 month ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 1 month ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6586#01929d7e-0e0e-4587-a624-0746e607a136

can-anyscale commented 1 month ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is flaky. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 1 month ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6590#0192a11e-0850-432b-94a9-7f9e26e4eb2a

can-anyscale commented 1 month ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 1 month ago

Blamed commit: f860b74c96acde8612702be3d4a8cb330ab59786 found by bisect job https://buildkite.com/ray-project/release-tests-bisect/builds/1633

can-anyscale commented 1 month ago

This test is now considered as flaky because it has been failing on postmerge for too long. Flaky tests do not run on premerge.

can-anyscale commented 2 weeks ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6922#0193227b-5aea-4c84-b22d-3bc9da072724

can-anyscale commented 1 week ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 1 week ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6954#01932836-4096-4cdf-8b78-136b298fe917

can-anyscale commented 1 week ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is flaky. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 1 week ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6986#019341f4-6c60-4347-8300-a41954d51f37

can-anyscale commented 1 week ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 1 week ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6995#019345a4-841a-4593-9cc1-1dc156bc7ec4

can-anyscale commented 1 week ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is flaky. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy