DataDog / extendeddaemonset

Kubernetes Extended Daemonset controller
Apache License 2.0
98 stars 13 forks source link

[CONTINT-3950] Fix rollout blocked on canary failure #187

Closed AliDatadog closed 2 months ago

AliDatadog commented 3 months ago

What does this PR do?

This PR changes the way we count unready pods during the rollout. Instead of looking at an annotation that was created with the new replicaset, we look at the number of unready pods among the old pods.

Motivation

As part of https://github.com/DataDog/extendeddaemonset/pull/166, we enhanced the EDS to keep track of unhealthy pods before a deployment by adding an annotation on the new ERS. However, it does not cover the case of canary failures. The annotation is never updated which means that the number of unreadyPods could be 0.

Additional Notes

N/A

Describe your test plan

AliDatadog commented 2 months ago

/merge

dd-devflow[bot] commented 2 months ago

:x: MergeQueue

You are not allowed to use the merge queue towards main.

If you need support, contact us on Slack #devflow with those details!