Open jcscottiii opened 1 year ago
I cannot submit several of my PRs because of this blocker. cc @foolip @jgraham (who force-merged previously) could you help investigate this?
Example PR: https://github.com/web-platform-tests/wpt/pull/40470 Logs: https://github.com/web-platform-tests/wpt/pull/40470/checks?check_run_id=14136304439
Another example PR: https://github.com/web-platform-tests/wpt/pull/40421 Logs: https://github.com/web-platform-tests/wpt/pull/40421/checks?check_run_id=14132619502
The failures are all related to webdriver classic, which are all unrelated to both PRs. You can confirm this by grepping for "FAIL" in the logs: all failures are pertaining /webdriver/tests/classic
.
Actually, the error messages ask to tag a group instead of individuals, so let's do that:
These may be pre-existing or new flakes. Please try to reproduce (see the above WPT command, though some flags may not be needed when running locally) and determine if your change introduced the flake. If you are unable to reproduce the problem, please tag @web-platform-tests/wpt-core-team in a comment for help.
These may be pre-existing or newly slow tests. Slow tests indicate that a test ran very close to the test timeout limit and so may become TIMEOUT-flaky in the future. Consider speeding up the test or breaking it into multiple tests. For help, please tag @web-platform-tests/wpt-core-team in a comment.
cc @web-platform-tests/wpt-core-team
I believe I understand the pattern. Whenever files in /webdriver/tests/support are touched, the CI fails because of the current pre-existing failures.
Now with https://github.com/web-platform-tests/rfcs/pull/131 merged what does it mean for those jobs? Do those changes have to be applied now?
Hey @whimboo! The next steps would be to implement that RFC. We are currently prioritizing our work. Once we have a timeline for that work we will comment on that RFC.
@nechaev-chromium I believe you may have fixed some of those issues, with https://chromium-review.googlesource.com/c/chromium/src/+/4675633
I fixed some of those with https://github.com/web-platform-tests/wpt/pull/40887
I was out for 3 weeks. @thiagowfx are those jobs are more stable nowadays? In case they still fail often what else might be left to do? It's at least good to see that this crash has been fixed!
Splitting the tests was overall helpful. They are more stable, but not completely. https://github.com/web-platform-tests/wpt/issues/41083 also needs to be fixed.
We have fixed two causes of ConnectionRefusedError. The fixes must be available since 117.0.5915.x
Is there any work left to do? Recently it looks pretty good around this job. Through I'm not sure how often it still fails for PRs and landings that I don't watch.
The question we should ask ourselves is: Do we still require admin merges to bypass this? If yes, then there's still work to do. I haven't merged any non-trivial PRs recently, @Lightning00Blade @OrKoN what's your experience in the last few weeks?
FYI we have had a quite good experience lately with admin merge requests when specifically asking the web-platform-tests/admins
team directly. Some person should always be around for help.
Note that #40990 was fixed, so the wpt-chrome-dev-stability check should no longer block any PR.
Note that there is also a bug in Chrome which causes an extra 100ms delay when trying to resize or re-position a window. With https://github.com/web-platform-tests/wpt/pull/43853 I'm going to add a workaround until it's fixed.
With this PR landed the chrome wdspec tests will drastically speed-up.
Background
This PR was stuck for awhile because there was a failure on the wpt-chrome-dev-stability GitHub action. The PR changed some webdriver test files.
The errors seen in the PR include:
Logs 1 (click to expand/collapse)
``` ERROR test_no_top_browsing_context - setup error: webdriver.error.UnknownErrorException: unknown error (500): unknown error: Chrome failed to start: crashed. ```Logs 2 (click to expand/collapse)
``` 79:42.61 INFO STDOUT: E webdriver.error.WebDriverException: tab crashed (500): tab crashed 79:42.61 INFO STDOUT: E (Session info: chrome=111.0.5562.0) 79:42.61 INFO STDOUT: E ```Logs 3 (click to expand/collapse)
``` 1:14.81 TEST_END: Test OK. Subtests passed 10/11. Unexpected 1 FAIL test_cross_origin[capabilities0] - webdriver.error.StaleElementReferenceException: stale element reference (404): stale element reference: element is not attached to the page document session =Logs 4 (click to expand/collapse)
``` setup error: webdriver.error.UnknownErrorException: unknown error (500): unknown error: Chrome failed to start: crashed ```To prove that it was unrelated, we created a PR that only touched the whitespace in the same files. From there, we could conclude that it was safe to merge since the same errors came up.
Risk of not resolving
Initial Hypotheses