Controller should fail fast(er).

aws / aws-application-networking-k8s

A Kubernetes controller for Amazon VPC Lattice

Apache License 2.0

162 stars 47 forks source link

Our team got hit by https://github.com/aws/aws-application-networking-k8s/issues/658 today. The proposal in https://github.com/aws/aws-application-networking-k8s/issues/659 would help a lot.

Additionally, I think that the controller should fail fast(er).

We use helm to install the controller, with the atomic: true option set; the rationale is that if the pods can't become ready, helm rolls back to the previous release.

Currently, the controller will become ready, but fail after a couple of minutes and go into CrashLoopBackOff.

Having the controller check for pre-requisites before becoming ready would prevent this behavior.

aws / aws-application-networking-k8s

Controller should fail fast(er). #660