metal3-io / baremetal-operator

Bare metal host provisioning integration for Kubernetes
Apache License 2.0
592 stars 253 forks source link

[flake] Docker restart fails in BMO e2e #1783

Open tuminoid opened 5 months ago

tuminoid commented 5 months ago

Which jobs are flaking?

BMO e2e tests

Which tests are flaking?

Randomly any of e2e variants

Since when has it been flaking?

Since moving to GH actions.

Jenkins link

https://github.com/metal3-io/baremetal-operator/actions/runs/9477201984/job/26111334840

Reason for failure (if possible)

n 12 05:29:25 minikube systemd[1]: docker.service: Deactivated successfully.
Jun 12 05:29:25 minikube systemd[1]: Stopped Docker Application Container Engine.
Jun 12 05:29:25 minikube systemd[1]: Starting Docker Application Container Engine...
Jun 12 05:29:25 minikube dockerd[796]: time="2024-06-12T05:29:25.668001724Z" level=info msg="Starting up"
Jun 12 05:30:25 minikube dockerd[796]: failed to start daemon: failed to dial "/run/containerd/containerd.sock": failed to dial "/run/containerd/containerd.sock": context deadline exceeded
Jun 12 05:30:25 minikube systemd[1]: docker.service: Main process exited, code=exited, status=1/FAILURE
Jun 12 05:30:25 minikube systemd[1]: docker.service: Failed with result 'exit-code'.
Jun 12 05:30:25 minikube systemd[1]: Failed to start Docker Application Container Engine.

Anything else we need to know?

No response

Label(s) to be applied

/kind flake

Rozzii commented 5 months ago

/triage accepted

metal3-io-bot commented 2 months ago

Issues go stale after 90d of inactivity. Mark the issue as fresh with /remove-lifecycle stale. Stale issues will close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

/lifecycle stale

tuminoid commented 1 month ago

/remove-lifecycle stale /lifecycle frozen

This is very much still happening.