kubernetes-sigs / cluster-api-provider-aws

Kubernetes Cluster API Provider AWS provides consistent deployment and day 2 operations of "self-managed" and EKS Kubernetes clusters on AWS.
http://cluster-api-aws.sigs.k8s.io/
Apache License 2.0
644 stars 569 forks source link

`iamauth` controller failing to reconcile due to informer issue #5128

Open alam0rt opened 1 month ago

alam0rt commented 1 month ago

/kind bug

What steps did you take and what happened: [A clear and concise description of what the bug is.]

The capa client-go is failing to list v1beta1.MachineDeployments which is in turn causing the iamauth controller to error out before it can reconcile.

  - lastTransitionTime: "2024-09-06T04:09:56Z"
    message: 'getting roles for remote workers: failed to get roles from machine deployments
      failed to list machine deployments for cluster default/manager: Timeout: failed
      waiting for *v1beta1.MachineDeployment Informer to sync'
    reason: IAMAuthenticatorConfigurationFailed

The cluster in question does not use any machine deployments (its an EKS fargate cluster).

What did you expect to happen:

For the iamauth controller to get an empty list of machine deployments and continue reconciliation.

Anything else you would like to add: [Miscellaneous information that will assist in solving the issue.]

Appears to be occurring here: https://github.com/kubernetes-sigs/cluster-api-provider-aws/blob/c2c96dd3d11d57cd2dfa4faaf28f7476c4a2a005/pkg/cloud/services/iamauth/reconcile.go#L125-L128

Environment:

k8s-ci-robot commented 1 month ago

This issue is currently awaiting triage.

If CAPA/CAPI contributors determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available [here](https://git.k8s.io/community/contributors/guide/pull-requests.md). If you have questions or suggestions related to my behavior, please file an issue against the [kubernetes-sigs/prow](https://github.com/kubernetes-sigs/prow/issues/new?title=Prow%20issue:) repository.
aniruddha2000 commented 1 week ago

/assign