Open mladedav opened 2 years ago
This issue has not had any activity in the past 30 days, so the
stale
label has been added to it.
stale
label will be removed if there is new activitykeepalive
label to exempt this issue from the stale check actionThank you for your contributions!
Seems like a genuine concern, not sure if KubeMemoryOvercommit
is the right alert for this but some alert which says "cluster is unable to schedule all pods" sounds useful.
We had a cluster where someone cordoned a few nodes and we have found out only when a node was restarted. I think that this should be part of the alert because a node failure may cause the cluster to be unable to schedule all pods.