Open Shaked opened 3 years ago
A known problem is Istio Sidecars https://istio.io/latest/docs/ops/common-problems/injection/#cluster-is-not-scaled-down-automatically
We need to add cluster-autoscaler.kubernetes.io/safe-to-evict": "true"
for all the pods with istio sidecar and are safe to evict.
We could add this to known services for users, welcome contributions!
While using kubeflow 1.0, 1.2, 1.3 I have noticed that sometimes nodes do not scale down.
AFAIU this happens because of node auto-provisioning. Nodes are scaled up and in some cases kube-system pods might start running on them, preventing them from scaling down.
https://github.com/kubernetes/autoscaler/issues/2377#issuecomment-618275429
Not sure if relevant but maybe these lines require an update?
https://github.com/kubeflow/gcp-blueprints/blob/1d41c6ca7fc904d91dfcfb44e61e42435801e72c/kubeflow/common/cluster/upstream/cluster.yaml#L32-L37
Currently I'm considering to cancel the node auto-provisioning although it would be nice to have this working as expected.
Any ideas how to fix this?