Open bitnik opened 4 days ago
Please note that MachineHealthChecks currently only support Machines that are owned by a MachineSet or a KubeadmControlPlane.
I missed this important part in the documentation. Still it is confusing though why it shows Ready as True.
Hello, first of all thanks a lot for your work!
I am quite new both to Cluster API and Talos. While I was playing around with
MachineHealthCheck
s, I realised that a control plane node shows all conditions as True even it can't join the etcd and not ready.There are 3 CP nodes:
There are 2 etcd memberes:
etcd logs of
test-cp-v1-8-3-8sqbv
is full of following logsetcd logs of leader:
Because it is permanently removed, it can't join and be rejected again and again. This is happening sometimes during my tests and the solution is just replacing the node. For this purpose I wanted to configure a MachineHealthCheck, but then I realized that is never detected as unhealthy because all conditions show True: