ray-project / ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
https://ray.io
Apache License 2.0
33.05k stars 5.59k forks source link

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is flaky #47309

Open can-anyscale opened 3 weeks ago

can-anyscale commented 3 weeks ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 3 weeks ago

This test is now considered as flaky because it has been failing on postmerge for too long. Flaky tests do not run on premerge.

can-anyscale commented 3 weeks ago

Blamed commit: fd84b9d631864f5dfca6c4bb6806a7e9c5bc1126 found by bisect job https://buildkite.com/ray-project/release-tests-bisect/builds/1484

can-anyscale commented 3 weeks ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6001#01918219-ea19-447c-bf36-e0cdec7c66d0

can-anyscale commented 3 weeks ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is flaky. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 2 weeks ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6048#01919bd1-4a12-4271-b06d-aae1599dc7bc

can-anyscale commented 2 weeks ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 2 weeks ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6071#0191a1a0-6cc8-4908-a3e1-22d8011d28e0

can-anyscale commented 2 weeks ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is consistently_failing. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 2 weeks ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6079#0191a1ad-7d55-4ae5-871f-cef93fc43b6b

can-anyscale commented 2 weeks ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is flaky. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 1 week ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6134#0191bf82-5d7a-4fe8-bddc-1e900e1a00f6

can-anyscale commented 1 week ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6135#0191c092-2264-40da-99b2-9cf16e074a46

can-anyscale commented 1 week ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is flaky. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy

can-anyscale commented 1 week ago

Test passed on latest run: https://buildkite.com/ray-project/postmerge/builds/6167#0191caa5-4d45-4ada-b123-123d47ba3098

can-anyscale commented 6 days ago

CI test linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu is flaky. Recent failures:

DataCaseName-linux://rllib:learning_tests_multi_agent_pendulum_sac_multi_gpu-END Managed by OSS Test Policy