Closed fullykubed closed 1 month ago
@fullykubed would this increase cost for users running kube_fledged?
@wesbragagt I am still collecting cost data to determine the impact, but I suspect it has an impact.
For this and a few other reasons, we are likely to going to fork the kube-fledged project and manage a custom version ourself that plays nicer with modern cluster components (kube-fledged is unmaintained it seems). Our goal is to have that integrated by the next stable release.
This is resolved with introduction of kyverno.
Prior Search
What happened?
Kube-fledged periodically runs pods on every node that attempt to pull images to ensure that node's image cache is up to date. This runs every 3 minutes in the current stack configuration.
However, while these pods are running, Karpenter cannot disrupt the nodes because the kube-fledged pods are bound to their nodes and cannot be rescheduled on different nodes (a requirement of karpeneter scale-down). Since kube-fledged runs so often, these often leaves Karpenter perpetually unable to disrupt nodes.
The challenge is that the kube-fledged sync does not run automatically on new node creation so unless the sync runs often its possible a node might not have images in its image cache when needed.
Personally, it seems like we might need to fork kube-fledged to add this capability since the project seems relatively unmaintained.
Steps to Reproduce
Default behavior of the stack. Simply observe.
Relevant log output