Closed sanjinp closed 3 months ago
This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.
This issue has been automatically closed due to inactivity.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.
This issue has been automatically closed due to inactivity.
This one hasn't been addressed yet, unfortunately I cannot reopen it
Report
We have:
ScaledJobs
queueLength: "1"
)pendingPodConditions
However we have noticed that if some of the Jobs fail (reach
backoffLimit
) Keda starts counting that Pod towards Pending, and never triggersExpected Behavior
Keda should trigger additional Pods despite having some of the previous Job failed, and should not count them into Pending.
Actual Behavior
Keda is counting Failed Jobs as Pending
Steps to Reproduce the Problem
backoffLimit
expires Failed job would not be cleared immediately, causing Keda to count it as PendingLogs from KEDA operator
and from kubectl:
KEDA Version
2.11.1
Kubernetes Version
1.24
Platform
Amazon Web Services
Scaler Details
AWS SQS
Anything else?
Manually clearing the Jobs is helping Keda to trigger new Pods based on a SQS queue message count. As a workaround we have implemented:
Which makes sure that Jobs were cleared regardless of Success or Failed, but this brings another 30s of scaling delay when it happens, affecting speed of our system reaction.
Question: Is it possible that when we are using
pendingPodConditions
we also check Pod status [Running|Failed|...] beside other conditions, and if there areFailed
statuses, we deduct it in Pending Pod count. Or we should do that somewhere else? Thanks!