Open tobiscr opened 2 months ago
Gardener exposes a helper method for detecting non-retryable errors. https://github.com/gardener/gardener/blob/539913d05582f88b80ea99cc53f2487aebaeeeab/pkg/apis/core/v1beta1/helper/errors.go#L181-L196
ERR_INFRA_RESOURCES_DEPLETED
and ERR_CLEANUP_CLUSTER_RESOURCES
where documentation neither suggests that they are retryable or not, but I decided to treat them as retryables as Gardener's HasNonRetryableErrorCode
does not consider those two error codes.
Description
A Gardener can end with a recoverable error (e.g. quota exceeded). We have to define on Provisioner side how we treat these errors. Potential errors are (see Gardener docs) listed in the appendix section.
We have to verify which error codes KIM will treat as recoverable and leave the
RuntimeCR
in anPending
state. Other error codes will be treated as non-recoverable and theRuntimeCR
will be set to statusError
.AC:
recoverable
andnon-recoverable
errors and share the result with the teamAppendix
ERR_INFRA_UNAUTHENTICATED
ERR_INFRA_UNAUTHORIZED
ERR_INFRA_QUOTA_EXCEEDED
ERR_INFRA_RATE_LIMITS_EXCEEDED
ERR_INFRA_DEPENDENCIES
ERR_RETRYABLE_INFRA_DEPENDENCIES
ERR_INFRA_RESOURCES_DEPLETED
ERR_CLEANUP_CLUSTER_RESOURCES
ERR_CONFIGURATION_PROBLEM
ERR_RETRYABLE_CONFIGURATION_PROBLEM
ERR_PROBLEMATIC_WEBHOOK
Reasons
React on error cases reported by Gardener and filter for recoverable / non-recoverable cases.
Attachments