Azure / OpenShift

Azure Red Hat OpenShift
https://docs.microsoft.com/azure/openshift/intro-openshift
MIT License
113 stars 37 forks source link

Consider including MachineHealthchecks as part of the cluster provisioning. #221

Open erzhan46 opened 3 years ago

erzhan46 commented 3 years ago

Consider including MachineHealthcheck as part of the cluster provisioning. Machine healthcheck API can help automatically remediate some issues with machines. See the following for more information: https://docs.openshift.com/container-platform/4.6/machine_management/deploying-machine-health-checks.html

jboutaud commented 3 years ago

Hi, can you provide some additional details around what action/operations you would like MachineHealthcheck to help with?

erzhan46 commented 3 years ago

In general - this capability will contribute to the 'Managed' nature of the ARO by introducing the 'self-healing' at Machine API level. Problem detection utilizes kubernetes node-problem-detector. Default remediation - machine (VM) deletion letting Machine API to create a new Machine. Examples: Machine is in phase 'Failed', Machine has no corresponding Node. See the following for more details: https://github.com/wking/openshift-enhancements/blob/master/enhancements/machine-api/machine-health-checking.md