Open ssarbadh opened 2 weeks ago
This issue is currently awaiting triage.
If Karpenter contributors determines this is a relevant issue, they will accept it by applying the triage/accepted
label and provide further guidance.
The triage/accepted
label can be added by org members by writing /triage accepted
in a comment.
I could fix it by stitching things from other repos, issue comments and documentation -
SQS stack needs to be present - Ref - https://github.com/aws/karpenter-provider-aws/blob/main/website/content/en/docs/getting-started/getting-started-with-karpenter/cloudformation.yaml
SQS permission needs to be given to Karpenter pod's service Account - Ref- https://github.com/aws/karpenter-provider-aws/issues/3185#issuecomment-1380648503
If this can be added to documentation or the --set "settings.interruptionQueue=${CLUSTER_NAME}" \
removed
That will help. Thanks
Using ARM SPOT instances and I can verify the following versions that the issue exist:
@ssarbadh What version of karpenter chart is being used here?
I temporarily resolved the issue by removing the following lines:
- name: INTERRUPTION_QUEUE
value: "<cluster-name>"
This prevented errors related to the non-existent interruption queue and allowed the pod to start as expected.
Description
Observed Behavior:
Pod fails to start - panic: AWS.SimpleQueueService.NonExistentQueue: The specified queue does not exist.
Expected Behavior: Pod runs
Reproduction Steps (Please include YAML): Follow this documentation. https://karpenter.sh/docs/getting-started/migrating-from-cas/
Doc mentions about setting a Interruption Queue -
--set "settings.interruptionQueue=${CLUSTER_NAME}"
But the policy for the service account - doesn't mention anything to do with Queue (sqs permissions).
Extra info A service account is created -
Deployment refers to queue
Policy attached to the Service Account is copied from documentation
This issue comment mentions some SQS permissions - https://github.com/aws/karpenter-provider-aws/issues/3185#issuecomment-1380648503
Versions:
Chart Version:
Kubernetes Version (
kubectl version
): 1.29Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
If you are interested in working on this issue or have submitted a pull request, please leave a comment