Open agardnerIT opened 2 years ago
Hi Adam,
I understand that this is an inconvenience right now. We can't fix this easily. We recently merged a PR https://github.com/keptn-contrib/job-executor-service/pull/249 that at least reports the correct error afterwards, but you will still run into the timeout.
The best way to fix this would be to refactor the Kubernetes Job Implementation (see https://github.com/keptn-contrib/job-executor-service/issues/244 ).
For now, what you could do is lower the timeout by setting maxPollDuration to something like 60 seconds: https://github.com/keptn-contrib/job-executor-service/blob/main/FEATURES.md#poll-duration
Any pod where the init container fails will never be able to start. JES should fail quickly and not wait for the timeout.
Recreate Issue
Try to run a pod as root:
Which results in:
Impact
Further tasks are blocked until the timeout (5mins by default).