ocurrent / opam-repo-ci

An OCurrent pipeline for testing submissions to opam-repository
Apache License 2.0
20 stars 22 forks source link

Improve fault tolerance of CI jobs #391

Open shonfeder opened 1 week ago

shonfeder commented 1 week ago

Its fairly common for jobs to fail with flaky errors (e.g. because of network issues). To clean up the reliability of the CI results, we want to start making these operations that can have flaky faults tolerant by adding retries and such.

Please add any flaky errors you encounter to this issue.


shonfeder commented 6 days ago

From https://github.com/ocaml/opam-repository/pull/26916#issuecomment-2488225735

2024-11-20 09:24.38: Got resource from pool OCluster
Building on m1-worker-03
error: RPC failed; curl 92 HTTP/2 stream 7 was not closed cleanly: CANCEL (err 8)
error: 10845 bytes of body are still expected
fetch-pack: unexpected disconnect while reading sideband packet
fatal: early EOF
fatal: fetch-pack: invalid index-pack output
git-fetch failed with exit-code 128