helm / kind-action

A GitHub Action for Kubernetes IN Docker - local clusters for testing Kubernetes
https://kind.sigs.k8s.io/
Apache License 2.0
306 stars 76 forks source link

post-action clean up job occasional failed on self-hosted github action runners #54

Closed jerry153fish closed 9 months ago

jerry153fish commented 2 years ago

I have seen the cleanup jobs fails on our ci for multiple times in our self-hosted github runners

ERROR: failed to delete cluster "local": failed to delete nodes: command "docker rm -f -v local-control-plane" failed with error: exit status 1

Command Output: Error response from daemon: Could not kill running container 25a65[4] xxxx cannot remove - tried to kill container, but did not receive an exit event

This might be caused by https://github.com/kubernetes-sigs/kind/issues/708.

Is it possible to add an option input failed_on_cleanup to force kind delete to return ture eg:

   kind delete xxx || true # if failed_on_cleanup set to false
bianchi2 commented 2 years ago

I see the same issue, though in my case it's related to types of workloads deployed to the cluster. To mitigate it I delete workloads and restart docker too.

Chinwendu20 commented 1 year ago

Yeah I am having the same issue