GH actions long delay between finishing build job and starting success job

n2ygk commented 9 months ago

Describe the bug

In watching multiple PRs after I've approved them, it appears to take a long time for the success job to start after the last step of the build job has finished. See #1219 where the separate success job was added to make it easier to update the matrix and only ever depend on build to finish for tests to succeed.

To Reproduce

Cause a PR to run tests.

Expected behavior

I didn't expect anything but was hoping that the wait for the success step wouldn't happen.

Version

current master branch

[ ] I have tested with the latest published release and it's still a problem.
[x] I have tested with the master branch and it's still a problem.

Additional context

@dopry I'm guessing that GH is allocating a runner(s) for each job, so after the build job finishes, we wait for another runner to become available for the success job. This takes a while. See below with timestamps selected. So I am guessing that running a second job that depends on the first has to wait for a new runner to become available. Sometimes correlation is indicative of causation.

Mon, 18 Dec 2023 17:59:18 GMT last matrix step of build job finished Mon, 18 Dec 2023 18:31:45 GMT success job starts

While watching the PR, the success job status is waiting on a runner. Here's some raw log showing the 30 minute wait for a runner:

2023-12-18T17:59:50.5029369Z Requested labels: ubuntu-latest
2023-12-18T17:59:50.5029714Z Job defined at: jazzband/django-oauth-toolkit/.github/workflows/test.yml@refs/heads/pre-commit-ci-update-config
2023-12-18T17:59:50.5029846Z Waiting for a runner to pick up this job...
2023-12-18T18:31:40.3399714Z Job is waiting for a hosted runner to come online.
2023-12-18T18:31:42.6767234Z Job is about to start running on the hosted runner: GitHub Actions 7 (hosted)
...