Closed eliaoggian closed 4 years ago
I found out the problem was the huge size of the DB due to never compacting and defragging it. This helped resolve the issue: https://github.com/etcd-io/etcd/blob/a621d807f061e1dd635033a8d6bc261461429e27/Documentation/op-guide/maintenance.md#space-quota
Issue
I deleted one node and as a consequence an etcd-cluster pod that was running on it. The pod is now recreating on another node but the readinessProbe fails, therefore the pod gets killed after reaching the timeout, and created again and again and again.
The member is being added to the cluster and removed correctly though.
The cluster is unable to reach the desired state of 5 ready nodes.
Versions
Values.yaml used to install the chart
Logs
I thought that this could be related to #2109
Any help is appreciated.
Thanks