pytorch / test-infra

This repository hosts code that supports the testing infrastructure for the main PyTorch repo. For example, this repo hosts the logic to track disabled tests and slow tests, as well as our continuation integration jobs HUD/dashboard.
https://hud.pytorch.org/
Other
73 stars 69 forks source link

Wrong Dr CI classification: unstable job causing real failures to be marked unstable #5402

Open clee2000 opened 3 weeks ago

clee2000 commented 3 weeks ago

The test_overrides failure is real. If you click on the related jobs for the jobs labeled unstable, they all link back to the experimental split build unstable jobs for the commit. I believe it is due to https://github.com/pytorch/test-infra/pull/5308

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/130047

Note: Links to docs will display an error until the docs builds have been completed.

:x: 2 New Failures, 1 Cancelled Job, 4 Unrelated Failures

As of commit b5f29ed3395eb6d1422f02f74af94dbf1b22362d with merge base 042d764872a6307ed2f087cf8a741b2106370207 (image):

NEW FAILURES - The following jobs have failed:

* [trunk / linux-focal-rocm6.1-py3.8 / test (default, 2, 2, linux.rocm.gpu)](https://hud.pytorch.org/pr/pytorch/pytorch/130047#27017216875) ([gh](https://github.com/pytorch/pytorch/actions/runs/9784693788/job/27017216875)) `inductor/test_torchinductor.py::SweepInputsCpuTest::test_cpu_broadcast1_dense` * [trunk / macos-py3-arm64 / test (default, 1, 3, macos-m1-stable)](https://hud.pytorch.org/pr/pytorch/pytorch/130047#27016377716) ([gh](https://github.com/pytorch/pytorch/actions/runs/9784693788/job/27016377716)) `test_overrides.py::TestTorchFunctionWarning::test_warn_on_invalid_torch_function`

CANCELLED JOB - The following job was cancelled. Please retry:

* [Check Labels](https://hud.pytorch.org/pr/pytorch/pytorch/130047#9812492686) ([gh](https://github.com/pytorch/pytorch/actions/runs/9812492686))

UNSTABLE - The following jobs failed but were likely due to flakiness present on trunk and has been marked as unstable:

* [pull / linux-focal-py3.11-clang10 / test (dynamo, 1, 3, linux.2xlarge)](https://hud.pytorch.org/pr/pytorch/pytorch/130047#27012281656) ([gh](https://github.com/pytorch/pytorch/actions/runs/9783316294/job/27012281656)) ([related job](https://hud.pytorch.org/pytorch/pytorch/commit/b5f29ed3395eb6d1422f02f74af94dbf1b22362d#27012283446)) `test_overrides.py::TestTorchFunctionWarning::test_warn_on_invalid_torch_function` * [pull / linux-focal-py3.12-clang10 / test (dynamo, 2, 3, linux.2xlarge)](https://hud.pytorch.org/pr/pytorch/pytorch/130047#27012276897) ([gh](https://github.com/pytorch/pytorch/actions/runs/9783316294/job/27012276897)) ([related job](https://hud.pytorch.org/pytorch/pytorch/commit/b5f29ed3395eb6d1422f02f74af94dbf1b22362d#27012283446)) `test_overrides.py::TestTorchFunctionWarning::test_warn_on_invalid_torch_function` * [pull / linux-focal-py3.12-clang10-experimental-split-build / test (dynamo, 1, 3, linux.2xlarge, unstable)](https://hud.pytorch.org/pr/pytorch/pytorch/130047#27012283446) ([gh](https://github.com/pytorch/pytorch/actions/runs/9783316294/job/27012283446)) ([#129256](https://hud.pytorch.org/pytorch/pytorch/issues/129256)) `test_overrides.py::TestTorchFunctionWarning::test_warn_on_invalid_torch_function` * [pull / linux-focal-py3.8-clang10 / test (dynamo, 2, 3, linux.2xlarge)](https://hud.pytorch.org/pr/pytorch/pytorch/130047#27012309913) ([gh](https://github.com/pytorch/pytorch/actions/runs/9783316294/job/27012309913)) ([related job](https://hud.pytorch.org/pytorch/pytorch/commit/b5f29ed3395eb6d1422f02f74af94dbf1b22362d#27012283446)) `test_overrides.py::TestTorchFunctionWarning::test_warn_on_invalid_torch_function`

This comment was automatically generated by Dr. CI and updates every 15 minutes.

clee2000 commented 3 weeks ago

Another incident here

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/129185

Note: Links to docs will display an error until the docs builds have been completed.

:white_check_mark: You can merge normally! (2 Unrelated Failures)

As of commit faa3c8c78ad2626d8c927ab0e38352ea496f7f05 with merge base e5841bb8d5aa1f413cdec1c904ed9b68b91ea356 (image):

UNSTABLE - The following jobs failed but were likely due to flakiness present on trunk and has been marked as unstable:

* [pull / linux-focal-py3.12-clang10 / test (dynamo, 3, 3, linux.2xlarge)](https://hud.pytorch.org/pr/pytorch/pytorch/129185#27116874928) ([gh](https://github.com/pytorch/pytorch/actions/runs/9814845603/job/27116874928)) ([related job](https://hud.pytorch.org/pytorch/pytorch/commit/faa3c8c78ad2626d8c927ab0e38352ea496f7f05#27116867888)) `test_linalg.py::TestLinalgCPU::test_lobpcg_torchscript_cpu_float64` * [pull / linux-focal-py3.12-clang10-experimental-split-build / test (dynamo, 3, 3, linux.2xlarge, unstable)](https://hud.pytorch.org/pr/pytorch/pytorch/129185#27116867888) ([gh](https://github.com/pytorch/pytorch/actions/runs/9814845603/job/27116867888)) `test_linalg.py::TestLinalgCPU::test_lobpcg_torchscript_cpu_float64`

This comment was automatically generated by Dr. CI and updates every 15 minutes.