sourcegraph / sourcegraph-public-snapshot

Code AI platform with Code Search & Cody
https://sourcegraph.com
Other
10.11k stars 1.29k forks source link

ci: Docker In Docker sidecar dying will led to a failed build and not a canceled one. #26080

Open jhchabran opened 3 years ago

jhchabran commented 3 years ago

This failed because of docker daemon going down:

The big problem here is that the build is a failure, when in reality it is being canceled. This will mess with observability if unaddressed.

Big thanks to @mrnugget for taking the time to jump and pair on his failed build to have a closer look.

jhchabran commented 3 years ago

Made some tests, anything happening to an agent (killed, lost, whatever) is being displayed as a failure, unless ... you start looking at the exit code at the very bottom or in the timeline. In some cases, it is being displayed as a "cancellation", but not in the one below where I manually killed the pod.

Looks like a failure 👇

CleanShot 2021-10-14 at 14 08 13@2x

But in reality👇

image