Closed mzameer777 closed 1 month ago
This issue is currently awaiting triage.
If cloud-provider-aws contributors determine this is a relevant issue, they will accept it by applying the triage/accepted
label and provide further guidance.
The triage/accepted
label can be added by org members by writing /triage accepted
in a comment.
You're probably getting the timeout here: https://github.com/kubernetes/cloud-provider-aws/blob/d7e05d57709cd46297490b51ce0dd11a54dbea35/cmd/ecr-credential-provider/main.go#L139
Do your nodes have network access to the ECR endpoint?
(the warning you're seeing in the output there is misleading but harmless. Fix for that in #1030)
I can confirm that the node has network connectivity to ECR VPC endpoint and it also has ECR full permissions.
What else can I look for, is there a way to debug this further in my env?
You verified that aws ecr get-login-password
works on the node?
You can try to reproduce the cred provider failure with something like:
echo '{"kind":"CredentialProviderRequest","apiVersion":"credentialprovider.kubelet.k8s.io/v1","image":"$IMAGE"}' | ecr-credential-provider
I was able to resolve this, my cluster was not having connectivity to ecr api. Thanks for helping me debug this.
What happened: I'm trying to implement image pull from private ECR, I have installed and configured ecr-credential-provider plugin, I'm getting this error in kubelet logs, and can't figure out how to proceed further
The plugin binary is executed and it says context deadline exceeded.
below is my configuration, I'm using Talos, so this is the creds config patch
Environment:
kubectl version
):Client Version: v1.31.1 Kustomize Version: v5.4.2 Server Version: v1.30.3
/kind bug