Closed albertschwarzkopf closed 1 year ago
Can you please post here the logs from a crash looping controller, the startup part? I guess AWS CNI wrongly blocks access to the API server, we do have an egress allow all policy.
@stefanprodan Hi thanks for the reply. Here are the logs.
helm-controller:
┌──────────────────────────────────────────────────────────────────────────────────── Logs(flux-system/helm-controller-58d6476fdf-scx46:manager)[tail] ────────────────────────────────────────────────────────────────────────────────────┐
│ Autoscroll:On FullScreen:Off Timestamps:Off Wrap:Off │
│ {"level":"info","ts":"2023-08-31T12:56:15.568Z","logger":"controller-runtime.metrics","msg":"Metrics server is starting to listen","addr":":8080"} │
│ {"level":"info","ts":"2023-08-31T12:56:15.570Z","logger":"setup","msg":"starting manager"} │
│ {"level":"info","ts":"2023-08-31T12:56:15.570Z","msg":"Starting server","kind":"health probe","addr":":9440"} │
│ {"level":"info","ts":"2023-08-31T12:56:15.671Z","msg":"starting server","path":"/metrics","kind":"metrics","addr":":8080"} │
│ {"level":"info","ts":"2023-08-31T12:56:15.671Z","logger":"runtime","msg":"attempting to acquire leader lease flux-system/helm-controller-leader-election...\n"} │
│ {"level":"info","ts":"2023-08-31T12:56:15.680Z","logger":"runtime","msg":"successfully acquired lease flux-system/helm-controller-leader-election\n"} │
│ {"level":"info","ts":"2023-08-31T12:56:15.681Z","msg":"Starting EventSource","controller":"helmrelease","controllerGroup":"helm.toolkit.fluxcd.io","controllerKind":"HelmRelease","source":"kind source: *v2beta1.HelmRelease"} │
│ {"level":"info","ts":"2023-08-31T12:56:15.681Z","msg":"Starting EventSource","controller":"helmrelease","controllerGroup":"helm.toolkit.fluxcd.io","controllerKind":"HelmRelease","source":"kind source: *v1beta2.HelmChart"} │
│ {"level":"info","ts":"2023-08-31T12:56:15.681Z","msg":"Starting Controller","controller":"helmrelease","controllerGroup":"helm.toolkit.fluxcd.io","controllerKind":"HelmRelease"} │
│ {"level":"info","ts":"2023-08-31T12:56:15.784Z","msg":"Starting workers","controller":"helmrelease","controllerGroup":"helm.toolkit.fluxcd.io","controllerKind":"HelmRelease","worker count":4} │
│ {"level":"info","ts":"2023-08-31T12:56:44.587Z","msg":"Stopping and waiting for non leader election runnables"} │
│ {"level":"info","ts":"2023-08-31T12:56:44.587Z","msg":"shutting down server","path":"/metrics","kind":"metrics","addr":":8080"} │
│ {"level":"info","ts":"2023-08-31T12:56:44.587Z","msg":"Stopping and waiting for leader election runnables"} │
│ {"level":"info","ts":"2023-08-31T12:56:44.587Z","msg":"Shutdown signal received, waiting for all workers to finish","controller":"helmrelease","controllerGroup":"helm.toolkit.fluxcd.io","controllerKind":"HelmRelease"}
kustomize-controller:
┌───────────────────────────────────────────────────────────────────────────────── Logs(flux-system/kustomize-controller-54987bd455-2448x:manager)[tail] ──────────────────────────────────────────────────────────────────────────────────┐
│ Autoscroll:On FullScreen:Off Timestamps:Off Wrap:Off │
│ {"level":"info","ts":"2023-08-31T12:55:45.989Z","logger":"controller-runtime.metrics","msg":"Metrics server is starting to listen","addr":":8080"} │
│ {"level":"info","ts":"2023-08-31T12:55:45.991Z","logger":"setup","msg":"starting manager"} │
│ {"level":"info","ts":"2023-08-31T12:55:45.991Z","msg":"Starting server","kind":"health probe","addr":":9440"} │
│ {"level":"info","ts":"2023-08-31T12:55:46.091Z","msg":"starting server","path":"/metrics","kind":"metrics","addr":":8080"} │
│ {"level":"info","ts":"2023-08-31T12:55:46.092Z","logger":"runtime","msg":"attempting to acquire leader lease flux-system/kustomize-controller-leader-election...\n"} │
│ {"level":"info","ts":"2023-08-31T12:55:46.119Z","logger":"runtime","msg":"successfully acquired lease flux-system/kustomize-controller-leader-election\n"} │
│ {"level":"info","ts":"2023-08-31T12:55:46.164Z","msg":"Starting EventSource","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","source":"kind source: *v1.Kustomization"} │
│ {"level":"info","ts":"2023-08-31T12:55:46.165Z","msg":"Starting EventSource","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","source":"kind source: *v1beta2.OCIRepository │
│ {"level":"info","ts":"2023-08-31T12:55:46.165Z","msg":"Starting EventSource","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","source":"kind source: *v1.GitRepository"} │
│ {"level":"info","ts":"2023-08-31T12:55:46.166Z","msg":"Starting EventSource","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","source":"kind source: *v1beta2.Bucket"} │
│ {"level":"info","ts":"2023-08-31T12:55:46.166Z","msg":"Starting Controller","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization"} │
│ {"level":"info","ts":"2023-08-31T12:55:46.269Z","msg":"Starting workers","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","worker count":4} │
│ {"level":"info","ts":"2023-08-31T12:55:46.269Z","msg":"Dependencies do not meet ready condition, retrying in 30s","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","Kustomi │
│ {"level":"info","ts":"2023-08-31T12:55:46.270Z","msg":"Dependencies do not meet ready condition, retrying in 30s","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","Kustomi │
│ {"level":"info","ts":"2023-08-31T12:55:46.270Z","msg":"Dependencies do not meet ready condition, retrying in 30s","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","Kustomi │
│ {"level":"info","ts":"2023-08-31T12:55:46.270Z","msg":"Dependencies do not meet ready condition, retrying in 30s","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","Kustomi │
│ {"level":"info","ts":"2023-08-31T12:55:46.290Z","msg":"Dependencies do not meet ready condition, retrying in 30s","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","Kustomi │
│ {"level":"info","ts":"2023-08-31T12:55:46.291Z","msg":"Dependencies do not meet ready condition, retrying in 30s","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","Kustomi │
│ {"level":"info","ts":"2023-08-31T12:55:46.296Z","msg":"Dependencies do not meet ready condition, retrying in 30s","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","Kustomi │
│ {"level":"info","ts":"2023-08-31T12:55:46.303Z","msg":"Dependencies do not meet ready condition, retrying in 30s","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","Kustomi │
│ {"level":"info","ts":"2023-08-31T12:55:46.362Z","msg":"Dependencies do not meet ready condition, retrying in 30s","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","Kustomi │
│ {"level":"info","ts":"2023-08-31T12:55:46.371Z","msg":"Dependencies do not meet ready condition, retrying in 30s","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","Kustomi │
│ {"level":"info","ts":"2023-08-31T12:56:14.943Z","msg":"Stopping and waiting for non leader election runnables"} │
│ {"level":"info","ts":"2023-08-31T12:56:14.943Z","msg":"shutting down server","path":"/metrics","kind":"metrics","addr":":8080"} │
│ {"level":"info","ts":"2023-08-31T12:56:14.943Z","msg":"Stopping and waiting for leader election runnables"} │
│ {"level":"info","ts":"2023-08-31T12:56:14.943Z","msg":"Shutdown signal received, waiting for all workers to finish","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization"} │
│ {"level":"info","ts":"2023-08-31T12:56:44.944Z","msg":"Stopping and waiting for caches"} │
│ {"level":"info","ts":"2023-08-31T12:56:44.944Z","msg":"Stopping and waiting for webhooks"} │
│ {"level":"info","ts":"2023-08-31T12:56:44.944Z","msg":"Wait completed, proceeding to shutdown the manager"} │
│ {"level":"error","ts":"2023-08-31T12:56:44.952Z","logger":"setup","msg":"problem running manager","error":"failed waiting for all runnables to end within grace period of 30s: context deadline exceeded"} │
│ Stream closed EOF for flux-system/kustomize-controller-54987bd455-2448x (manager)
source-controlle:
┌───────────────────────────────────────────────────────────────────────────────────── Logs(flux-system/source-controller-f46df59c-gbjnb:manager)[1m] ─────────────────────────────────────────────────────────────────────────────────────┐
│ Autoscroll:On FullScreen:Off Timestamps:Off Wrap:On │
│ {"level":"info","ts":"2023-08-31T13:00:34.581Z","logger":"controller-runtime.metrics","msg":"Metrics server is starting to listen","addr":":8080"} │
│ {"level":"info","ts":"2023-08-31T13:00:34.581Z","logger":"setup","msg":"caching of Helm index files is disabled"} │
│ {"level":"info","ts":"2023-08-31T13:00:34.585Z","logger":"setup","msg":"starting manager"} │
│ {"level":"info","ts":"2023-08-31T13:00:34.585Z","msg":"Starting server","kind":"health probe","addr":":9440"} │
│ {"level":"info","ts":"2023-08-31T13:00:34.685Z","msg":"starting server","path":"/metrics","kind":"metrics","addr":":8080"} │
│ {"level":"info","ts":"2023-08-31T13:00:34.686Z","logger":"runtime","msg":"attempting to acquire leader lease flux-system/source-controller-leader-election...\n"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.364Z","msg":"Stopping and waiting for non leader election runnables"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.364Z","msg":"shutting down server","path":"/metrics","kind":"metrics","addr":":8080"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.364Z","msg":"Stopping and waiting for leader election runnables"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.365Z","msg":"Starting EventSource","controller":"gitrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"GitRepository","source":"kind source: *v1.GitRepository"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.365Z","msg":"Starting Controller","controller":"gitrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"GitRepository"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.368Z","msg":"Starting EventSource","controller":"bucket","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"Bucket","source":"kind source: *v1beta2.Bucket"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.369Z","msg":"Starting Controller","controller":"bucket","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"Bucket"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.369Z","msg":"Starting EventSource","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository","source":"kind source: *v1beta2.HelmRepository │
│ "} │
│ {"level":"info","ts":"2023-08-31T13:01:03.370Z","msg":"Starting Controller","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.371Z","msg":"Starting EventSource","controller":"helmchart","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmChart","source":"kind source: *v1beta2.HelmChart"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.371Z","msg":"Starting EventSource","controller":"helmchart","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmChart","source":"kind source: *v1beta2.HelmRepository"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.372Z","msg":"Starting EventSource","controller":"helmchart","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmChart","source":"kind source: *v1.GitRepository"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.372Z","msg":"Starting EventSource","controller":"helmchart","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmChart","source":"kind source: *v1beta2.Bucket"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.372Z","msg":"Starting Controller","controller":"helmchart","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmChart"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.374Z","msg":"Starting EventSource","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository","source":"kind source: *v1beta2.HelmRepository │
│ "} │
│ {"level":"info","ts":"2023-08-31T13:01:03.374Z","msg":"Starting Controller","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.376Z","msg":"Starting EventSource","controller":"ocirepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"OCIRepository","source":"kind source: *v1beta2.OCIRepository"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.376Z","msg":"Starting Controller","controller":"ocirepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"OCIRepository"} │
│ {"level":"error","ts":"2023-08-31T13:01:03.379Z","logger":"controller-runtime.source.EventHandler","msg":"failed to get informer from cache","error":"Timeout: failed waiting for *v1.GitRepository Informer to sync"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.379Z","msg":"Starting workers","controller":"bucket","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"Bucket","worker count":2} │
│ {"level":"info","ts":"2023-08-31T13:01:03.380Z","msg":"Shutdown signal received, waiting for all workers to finish","controller":"bucket","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"Bucket"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.381Z","msg":"Starting workers","controller":"helmchart","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmChart","worker count":2} │
│ {"level":"info","ts":"2023-08-31T13:01:03.382Z","msg":"Shutdown signal received, waiting for all workers to finish","controller":"helmchart","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmChart"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.382Z","msg":"Starting workers","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository","worker count":2} │
│ {"level":"info","ts":"2023-08-31T13:01:03.383Z","msg":"Shutdown signal received, waiting for all workers to finish","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.384Z","msg":"Starting workers","controller":"gitrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"GitRepository","worker count":2} │
│ {"level":"info","ts":"2023-08-31T13:01:03.384Z","msg":"Shutdown signal received, waiting for all workers to finish","controller":"gitrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"GitRepository"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.384Z","msg":"Starting workers","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository","worker count":2} │
│ {"level":"info","ts":"2023-08-31T13:01:03.386Z","msg":"Shutdown signal received, waiting for all workers to finish","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.388Z","msg":"Starting workers","controller":"ocirepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"OCIRepository","worker count":2} │
│ {"level":"info","ts":"2023-08-31T13:01:03.388Z","msg":"Shutdown signal received, waiting for all workers to finish","controller":"ocirepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"OCIRepository"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.388Z","msg":"All workers finished","controller":"bucket","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"Bucket"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.389Z","msg":"All workers finished","controller":"helmchart","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmChart"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.390Z","msg":"All workers finished","controller":"gitrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"GitRepository"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.390Z","msg":"All workers finished","controller":"ocirepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"OCIRepository"} │
│ {"level":"error","ts":"2023-08-31T13:01:03.402Z","logger":"controller-runtime.source.EventHandler","msg":"failed to get informer from cache","error":"Timeout: failed waiting for *v1beta2.Bucket Informer to sync"} │
│ {"level":"error","ts":"2023-08-31T13:01:03.404Z","logger":"controller-runtime.source.EventHandler","msg":"failed to get informer from cache","error":"Timeout: failed waiting for *v1beta2.Bucket Informer to sync"} │
│ {"level":"error","ts":"2023-08-31T13:01:03.462Z","logger":"controller-runtime.source.EventHandler","msg":"failed to get informer from cache","error":"Timeout: failed waiting for *v1beta2.OCIRepository Informer to sync"} │
│ {"level":"error","ts":"2023-08-31T13:01:03.465Z","msg":"Reconciler error","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"karpenter","namespace": │
│ "flux-system"},"namespace":"flux-system","name":"karpenter","reconcileID":"77323dd0-7c48-45b2-9ed4-9b09b4b5e505","error":"context canceled","errorCauses":[{"error":"context canceled","errorCauses":[{"error":"context canceled"},{"err │
│ or":"context canceled"}]}]} │
│ {"level":"info","ts":"2023-08-31T13:01:03.468Z","msg":"All workers finished","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository"} │
│ {"level":"info","ts":"2023-08-31T13:01:03.881Z","msg":"artifact up-to-date with remote revision: 'sha256:b29bb5693fc36b56aadcc53b2ab6020ac9f3707ffe7365e27b8132070207e51f'","controller":"helmrepository","controllerGroup":"source.tool │
│ kit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"incubator","namespace":"flux-system"},"namespace":"flux-system","name":"incubator","reconcileID":"f9ecb312-89d5-4159-919b-f8a7fe420ef5"} │
│ {"level":"info","ts":"2023-08-31T13:01:04.062Z","msg":"artifact up-to-date with remote revision: 'sha256:1df959b442f64a7e148eacfe72b64358cda5c0c73b9d9f4b52c842e639cadb96'","controller":"helmrepository","controllerGroup":"source.tool │
│ kit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"external-secrets","namespace":"external-secrets"},"namespace":"external-secrets","name":"external-secrets","reconcileID":"dfc7f964-f42b-4241-af24-f2acd80a570 │
│ 5"} │
│ {"level":"info","ts":"2023-08-31T13:01:04.270Z","msg":"artifact up-to-date with remote revision: 'sha256:92273dedebeaa6c78831a238dba1b59e6643ff64bee1258822d67f0f4674ad38'","controller":"helmrepository","controllerGroup":"source.tool │
│ kit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"traefik","namespace":"traefik"},"namespace":"traefik","name":"traefik","reconcileID":"624ed1cc-fee8-4ee6-a72c-0919386c3128"} │
│ {"level":"info","ts":"2023-08-31T13:01:04.275Z","msg":"artifact up-to-date with remote revision: 'sha256:b61b15d27295678e1f0f55ce2252e59d0d5eefcfd9378438987f0c7d22b8874b'","controller":"helmrepository","controllerGroup":"source.tool │
│ kit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"termination-handler","namespace":"kube-system"},"namespace":"kube-system","name":"termination-handler","reconcileID":"3eb73cd9-64c8-449c-880c-efa43f804ac5"} │
│ {"level":"info","ts":"2023-08-31T13:01:04.367Z","msg":"artifact up-to-date with remote revision: 'sha256:fc2366fb61e7d6a0e3885ad3a9dbb4603483bab7fb7be193d7a19a32531ab33b'","controller":"helmrepository","controllerGroup":"source.tool │
│ kit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"keda","namespace":"keda"},"namespace":"keda","name":"keda","reconcileID":"a680a92c-8fce-40d8-b1bb-a00b2ea3f93b"} │
│ {"level":"info","ts":"2023-08-31T13:01:04.462Z","msg":"artifact up-to-date with remote revision: 'sha256:fc8b77772f25bd9ee80e623ca9dae6d75d9e12d7a0881f8e6569698c77e39c95'","controller":"helmrepository","controllerGroup":"source.tool │
│ kit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"arvato","namespace":"flux-system"},"namespace":"flux-system","name":"arvato","reconcileID":"76ad94ec-061b-4a76-930f-7622eaea47cc"} │
│ {"level":"info","ts":"2023-08-31T13:01:04.471Z","msg":"artifact up-to-date with remote revision: 'sha256:6a6d60d7687d169ef6a98773250ade991b2666bd2b139b35585031a76ec54a1d'","controller":"helmrepository","controllerGroup":"source.tool │
│ kit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"aws-ebs-csi-snapshot-controller","namespace":"kube-system"},"namespace":"kube-system","name":"aws-ebs-csi-snapshot-controller","reconcileID":"388a54da-bd5e-4 │
│ b3b-9c71-f9ea09c42e2a"} │
│ {"level":"info","ts":"2023-08-31T13:01:04.573Z","msg":"artifact up-to-date with remote revision: 'sha256:bcf4a99bdd95c734e79f335eace5c60ae98499379b87a4e91c4eab5c417f3edb'","controller":"helmrepository","controllerGroup":"source.tool │
│ kit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"descheduler","namespace":"kube-system"},"namespace":"kube-system","name":"descheduler","reconcileID":"da94742b-c190-48f0-a20e-b8228c611477"} │
│ {"level":"info","ts":"2023-08-31T13:01:04.662Z","msg":"artifact up-to-date with remote revision: 'sha256:30ff2c24cdf104e8a8c1d4cabd29d67b1be26ff7fa30ee7c81fd697cd5f6c0f8'","controller":"helmrepository","controllerGroup":"source.tool │
│ kit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"cert-manager","namespace":"cert-manager"},"namespace":"cert-manager","name":"cert-manager","reconcileID":"901feeed-d2fc-4b87-af8f-741cfc422371"} │
│ {"level":"info","ts":"2023-08-31T13:01:04.674Z","msg":"artifact up-to-date with remote revision: 'sha256:b61b15d27295678e1f0f55ce2252e59d0d5eefcfd9378438987f0c7d22b8874b'","controller":"helmrepository","controllerGroup":"source.tool │
│ kit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"aws-alb-ingress-controller","namespace":"kube-system"},"namespace":"kube-system","name":"aws-alb-ingress-controller","reconcileID":"aea57543-95d1-4b4e-a4cf-7 │
│ 9020320e863"} │
│ {"level":"info","ts":"2023-08-31T13:01:04.865Z","msg":"artifact up-to-date with remote revision: 'sha256:c019a926b73fac9181437efdf22a3dd42c58192be83c6731ae3ed578661b51bb'","controller":"helmrepository","controllerGroup":"source.tool │
│ kit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"vertical-pod-autoscaler","namespace":"kube-system"},"namespace":"kube-system","name":"vertical-pod-autoscaler","reconcileID":"319d398c-2e9b-4e1d-bfb6-eaca77b │
│ 1a448"} │
│ {"level":"error","ts":"2023-08-31T13:01:05.263Z","msg":"failed to patch","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"opencost","namespace":"o │
│ pencost"},"namespace":"opencost","name":"opencost","reconcileID":"953b434f-2f02-4050-9b21-4e1ad7311270","error":"context canceled","errorCauses":[{"error":"context canceled"},{"error":"context canceled"}]} │
│ {"level":"error","ts":"2023-08-31T13:01:05.265Z","msg":"Reconciler error","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"opencost","namespace":" │
│ opencost"},"namespace":"opencost","name":"opencost","reconcileID":"953b434f-2f02-4050-9b21-4e1ad7311270","error":"context canceled","errorCauses":[{"error":"context canceled","errorCauses":[{"error":"context canceled"},{"error":"con │
│ text canceled"}]},{"error":"context canceled","errorCauses":[{"error":"context canceled"},{"error":"context canceled"}]}]} │
│ {"level":"error","ts":"2023-08-31T13:01:05.664Z","msg":"failed to patch","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"node-problem-detector"," │
│ namespace":"kube-system"},"namespace":"kube-system","name":"node-problem-detector","reconcileID":"c5d6e0d6-b837-421a-87ae-3472b39a0e58","error":"context canceled","errorCauses":[{"error":"context canceled"},{"error":"context cancele │
│ d"}]} │
│ {"level":"error","ts":"2023-08-31T13:01:05.666Z","msg":"Reconciler error","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"node-problem-detector", │
│ "namespace":"kube-system"},"namespace":"kube-system","name":"node-problem-detector","reconcileID":"c5d6e0d6-b837-421a-87ae-3472b39a0e58","error":"context canceled","errorCauses":[{"error":"context canceled","errorCauses":[{"error":" │
│ context canceled"},{"error":"context canceled"}]},{"error":"context canceled","errorCauses":[{"error":"context canceled"},{"error":"context canceled"}]}]} │
│ {"level":"info","ts":"2023-08-31T13:01:05.808Z","msg":"artifact up-to-date with remote revision: 'sha256:6c904d7c78ccddcd015974879b2ac15e2de1154a72df2e77efce445a9994bde1'","controller":"helmrepository","controllerGroup":"source.tool │
│ kit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"reloader","namespace":"stakater-reloader"},"namespace":"stakater-reloader","name":"reloader","reconcileID":"d3afe587-5b66-4855-97c7-e9dbc45a3876"} │
│ {"level":"error","ts":"2023-08-31T13:01:06.174Z","msg":"failed to patch","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"trivy-server","namespace │
│ ":"security-system"},"namespace":"security-system","name":"trivy-server","reconcileID":"a4bc5e72-c4be-4229-b251-b978b1714662","error":"context canceled","errorCauses":[{"error":"context canceled"},{"error":"context canceled"}]} │
│ {"level":"error","ts":"2023-08-31T13:01:06.177Z","msg":"Reconciler error","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"trivy-server","namespac │
│ e":"security-system"},"namespace":"security-system","name":"trivy-server","reconcileID":"a4bc5e72-c4be-4229-b251-b978b1714662","error":"context canceled","errorCauses":[{"error":"context canceled","errorCauses":[{"error":"context ca │
│ nceled"},{"error":"context canceled"}]},{"error":"context canceled","errorCauses":[{"error":"context canceled"},{"error":"context canceled"}]}]} │
│ {"level":"error","ts":"2023-08-31T13:01:07.268Z","msg":"failed to patch","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"deliveryhero","namespace │
│ ":"flux-system"},"namespace":"flux-system","name":"deliveryhero","reconcileID":"9e5e1fcc-20f7-4092-9079-497218991dc2","error":"context canceled","errorCauses":[{"error":"context canceled"},{"error":"context canceled"}]} │
│ {"level":"error","ts":"2023-08-31T13:01:07.273Z","msg":"Reconciler error","controller":"helmrepository","controllerGroup":"source.toolkit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"deliveryhero","namespac │
│ e":"flux-system"},"namespace":"flux-system","name":"deliveryhero","reconcileID":"9e5e1fcc-20f7-4092-9079-497218991dc2","error":"context canceled","errorCauses":[{"error":"context canceled","errorCauses":[{"error":"context canceled"} │
│ ,{"error":"context canceled"}]},{"error":"context canceled","errorCauses":[{"error":"context canceled"},{"error":"context canceled"}]}]} │
│ {"level":"info","ts":"2023-08-31T13:01:10.363Z","msg":"artifact up-to-date with remote revision: 'sha256:a97a6db135ad1913280ad0b2c6e6b205eddcf9e24ce2096b1385a81a30c281b6'","controller":"helmrepository","controllerGroup":"source.tool │
│ kit.fluxcd.io","controllerKind":"HelmRepository","HelmRepository":{"name":"stable","namespace":"flux-system"},"namespace":"flux-system","name":"stable","reconcileID":"9468eb52-945c-448f-b064-8dd46f9a5f32"} │
│ {"level":"info","ts":"2023-08-31T13:01:10.572Z","logger":"runtime","msg":"successfully acquired lease flux-system/source-controller-leader-election\n"} │
│ {"level":"info","ts":"2023-08-31T13:01:10.573Z","logger":"setup","msg":"starting file server"} │
│ Stream closed EOF for flux-system/source-controller-f46df59c-gbjnb (manager)
So context canceled
tells me that the AWS CNI blocks the access to Kubernetes API. IMO this is a bug in the CNI policy as we have an allow all egress that works with Calico, Cilium, Weave Net, etc. The policy is here: https://github.com/fluxcd/flux2/blob/main/manifests/policies/allow-egress.yaml
I have opened a support case in the affected AWS account. I will update this issue here.
AWS Support was able to reproduce this issue. The Calico setup is working. I'm waiting for further information.
Ran into this exact issue. Were you able to get any further info from AWS? @albertschwarzkopf
@teshsharma AWS is still working on this
We are seeing a similar failure mode. SourceController comes up in EKS, but KustomizationController fails with these logs:
{"level":"info","ts":"2023-09-13T10:11:51.116Z","logger":"controller-runtime.metrics","msg":"Metrics server is starting to listen","addr":":8080"}
{"level":"info","ts":"2023-09-13T10:11:51.117Z","logger":"setup","msg":"starting manager"}
{"level":"info","ts":"2023-09-13T10:11:51.117Z","msg":"Starting server","kind":"health probe","addr":"[::]:9440"}
{"level":"info","ts":"2023-09-13T10:11:51.218Z","msg":"starting server","path":"/metrics","kind":"metrics","addr":"[::]:8080"}
{"level":"info","ts":"2023-09-13T10:11:51.218Z","logger":"runtime","msg":"attempting to acquire leader lease flux2/kustomize-controller-leader-election...\n"}
{"level":"info","ts":"2023-09-13T10:12:16.627Z","msg":"Stopping and waiting for non leader election runnables"}
{"level":"info","ts":"2023-09-13T10:12:16.627Z","msg":"shutting down server","path":"/metrics","kind":"metrics","addr":"[::]:8080"}
{"level":"info","ts":"2023-09-13T10:12:16.627Z","msg":"Stopping and waiting for leader election runnables"}
{"level":"info","ts":"2023-09-13T10:12:16.628Z","msg":"Starting EventSource","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","source":"kind source: *v1.Kustomization"}
{"level":"info","ts":"2023-09-13T10:12:16.628Z","msg":"Starting EventSource","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","source":"kind source: *v1beta2.OCIRepository"}
{"level":"info","ts":"2023-09-13T10:12:16.628Z","msg":"Starting EventSource","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","source":"kind source: *v1.GitRepository"}
{"level":"info","ts":"2023-09-13T10:12:16.628Z","msg":"Starting EventSource","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","source":"kind source: *v1beta2.Bucket"}
{"level":"info","ts":"2023-09-13T10:12:16.628Z","msg":"Starting Controller","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization"}
{"level":"error","ts":"2023-09-13T10:12:16.628Z","msg":"Could not wait for Cache to sync","controller":"kustomization","controllerGroup":"kustomize.toolkit.fluxcd.io","controllerKind":"Kustomization","error":"failed to wait for kustomization caches to sync: context canceled"}
{"level":"info","ts":"2023-09-13T10:12:16.628Z","msg":"Stopping and waiting for caches"}
{"level":"error","ts":"2023-09-13T10:12:16.628Z","msg":"error received after stop sequence was engaged","error":"failed to wait for kustomization caches to sync: context canceled"}
{"level":"info","ts":"2023-09-13T10:12:16.628Z","msg":"Stopping and waiting for webhooks"}
{"level":"info","ts":"2023-09-13T10:12:16.628Z","msg":"Wait completed, proceeding to shutdown the manager"}
As a side note, it looks like the KustomizationController and AWS EKS Node Agent can conflict on the port used for metrics (8080). Note that the latest AWS VPC CNI release: https://github.com/aws/amazon-vpc-cni-k8s/releases/tag/v1.14.1 changed the default metrics and health probe ports to avoid this conflict
we also have this problem running aws vpc cni 1.14.1
@jdn5126 I don't think the port conflict there really matters. You'd only run into conflict if both node-agent and kustomize-controller run on hostNetwork: true
mode. But while node-agent does do so, there's no reason for kustomize-controller to run on the hostNetwork.
we managed to get the pods to stop crashing by setting the ingress rule to allow traffic from 0.0.0.0/0 or {} but yeah not ideal
https://github.com/aws/amazon-vpc-cni-k8s/issues/2571
So let us wait for the next release.
Hi,
Amazon VPC CNI now supports Kubernetes Network Policies. I have enabled this in our DEV cluster, but the helm-controller, kustomize-controller and source-controller are crashing then (CrashLoopBackOff). Notification-controller is running.
We are using EKS 1.27 and Flux v2.0.1.
No other Networkpolicies are deployed yet. Which additional permits needs Flux else? E.g. Api-Server, ...
Steps to reproduce
Create AWS EKS 1.27 Cluster (with Bottlerocket OS 1.14.3) and enable Networkpolicies in the VPC CNI Add-On. Deploy Flux.
Expected behavior
Flux should not crashes with default enabled Networkpolicies.
OS / Distro
Bottlerocket OS 1.14.3 (aws-k8s-1.27)
Flux version
v2.0.1
Flux check
N/A
Code of Conduct