pre-commit-ci / issues

public issues for https://pre-commit.ci
17 stars 3 forks source link

pre-commit.ci timing out when passes locally #62

Closed matthewfeickert closed 3 years ago

matthewfeickert commented 3 years ago

:wave: Hi. pre-commit.ci is failing by timeout for PR https://github.com/scikit-hep/pyhf/pull/1403 when it passes locally in a fresh virtual environment (and also passes in pre-commit.ci but then timesout).

c.f. https://results.pre-commit.ci/repo/github/118789569

pre-commit_failure

and for a particular failing run

run_failure

This is probably just transitory issue, but thought I'd still report it.

cc @lukasheinrich @kratsg


Also example of my claim that pre-commit passes locally:


(base) $ git checkout feat/clean-public-api-all  # Branch for the PR that is failing
(base) $ pyenv virtualenv 3.8.7 test-pre-commit
(base) $ pyenv activate test-pre-commit 
(test-pre-commit) $ pip install --upgrade pip setuptools wheel
(test-pre-commit) $ pip install pre-commit
(test-pre-commit) $ pre-commit run --all-files
Check for added large files..............................................Passed
Check for case conflicts.................................................Passed
Check for merge conflicts................................................Passed
Check for broken symlinks................................................Passed
Check JSON...............................................................Passed
Check Yaml...............................................................Passed
Check Toml...............................................................Passed
Check Xml................................................................Passed
Debug Statements (Python)................................................Passed
Fix End of Files.........................................................Passed
Mixed line ending........................................................Passed
Fix requirements.txt.................................(no files to check)Skipped
Trim Trailing Whitespace.................................................Passed
black....................................................................Passed
blacken-docs.............................................................Passed
flake8...................................................................Passed
pyupgrade................................................................Passed
nbqa-black...............................................................Passed
nbqa-pyupgrade...........................................................Passed
asottile commented 3 years ago

very strange, timings do look elevated today according to my metrics -- let me look into whether something changed

matthewfeickert commented 3 years ago

I doubt this matters much, but the 3 shown timeouts above are happening at different stages:

matthewfeickert commented 3 years ago

@asottile retriggering the run from the GitHub comments has things passing now (after a long queue time): https://results.pre-commit.ci/run/github/118789569/1619557729.aLz8qBFvTBiIxfXakvechw

asottile commented 3 years ago

yeah the queue makes sense, I was kicking off a bunch of runs at the same time while the hosts were cycling.

there were no code changes during the period that led to higher timeouts, I suspect one of the hosts got a noisy neighbor in aws:

image

I'll be putting in some automated alerts to catch this particular failure mode in the future -- thanks for the report!

I'm going to send a message to the mailing list to make sure others know about this and follow up with a postmortem once I'm comfortable that it is resolved

I'll be watching this closely over the next couple of hours to make sure that fixed it

I'll also be sending out a postmortem entry to the mailing list

matthewfeickert commented 3 years ago

Awesome. :) Many thanks for this report and also for being :zap: fast in your feedback and help!

asottile commented 3 years ago

marking this all clear, run times have returned to normal after mitigation


postmortem

root cause

unknown

what went well

what didn't go well

follow-up

Borda commented 3 years ago

Hello, just curious is there is a way how to extend the timeout? https://results.pre-commit.ci/run/github/145693916/1622449847.gyQnW8ktQPCcmngXvQmabA