Open kurekbharath opened 1 year ago
Can someone provide input on this issue? Is this expected behavior or some config I have done is wrong ? or does This need to fixed from code side?
this is expected behavior, from https://fluxcd.io/flagger/usage/webhooks/:
confirm-promotion hooks are executed before the promotion step. The canary promotion is paused until the hooks return HTTP 200. While the promotion is paused, Flagger will continue to run the metrics checks and rollout hooks.
if you want to rollback, then specify another webhook of type rollback
and make the webhook server return a response with a 2xx status code after the Canary is stuck at WaitingPromotion
after your desired timeout.
@aryan9600 If canary stuck in state Promoting
, how do I make Canary fail ?
Canary status stuck in WaitingPromotion for a long duration.
Canary status is stuck in WaitingPromotion status for more than hours with the message Halt sampleapp.testnamespace advancement waiting for promotion approval pre-rollout, where in the canary manifest, we have mentioned timeout for 2-3 min, even after 2-3 min if webhook doesn't return 200 response we expect the canary status to mark as failed. But the canary status is stuck in WaitingPromotion status.
I have tried to use webhook of the type confirm-promotion and pre-rollout for this test testing still status is stuck on WaitingPromotion status.
To Reproduce
Deploy a new change. Once the canary load test is successful, the webhook return 200 then roll out the changes to primary pods(Its working) If the webhook doesn't return 200 within a certain period of time(2m timeout we set in our case), it should timeout and mark the canary as failed status. (NOt working)
Below is the sample canary yaml file used
Below is the canary event
Expected behavior
If webhook doesn't return 200 without timeout set mark the canary as failed status