Closed enekofb closed 1 year ago
reproduced locally by not alowing the gitopscluster service account to impersonate collector service account in leaf-custer-1
Error: hydrating query server: cannot start access rule collector: could not start role collector: cannot watch cluster: failed to start watcher for cluster flux-system/leaf-cluster-1: cannot create watcher manager: cannot create controller manager: unknown
Expected behaviour logged error raising the issue app do not panic app starts with degraded explorer experience
The following logic will be implemented
Now:
Future -> in the context of reliability
this error just seen also in demo2
it should provide context
{"level":"error","ts":"2023-05-02T09:15:13.299Z","msg":"cannot start watcher","error":"failed to wait for role caches to sync: timed out waiting for cache to be synced"}
Enabled Staging
https://github.com/weaveworks/weave-gitops-clusters/commit/378f8ccf991ac5094d5a13e96b3de477d96a00b8
And we could see how failures are being logged but
{"level":"error","ts":"2023-05-05T06:52:18.753Z","msg":"Failed to get API Group-Resources","error":"unknown"}
{"level":"error","ts":"2023-05-05T06:52:18.753Z","msg":"cannot watch cluster","cluster":"default/prod","error":"failed to start watcher for cluster default/prod: cannot create watcher manager: cannot create controller manager: unknown"}
{"level":"error","ts":"2023-05-05T06:52:18.763Z","msg":"Failed to get API Group-Resources","error":"unknown"}
{"level":"error","ts":"2023-05-05T06:52:18.763Z","msg":"cannot watch cluster","cluster":"flux-system/dev","error":"failed to start watcher for cluster flux-system/dev: cannot create watcher manager: cannot create controller manager: unknown"}
But the app is up and running
Explorer collector needs RBAC from v0.22.0 to watch leaf clusters
When this RBAC is not present the following messages show up
Which is not great to understand cause, effect, fix
This ticket to
AC
Notes
Scenarios
given leaf cluster connected to wge without collector RBAC when upgraded to 0.22 with explorer enabled
Current behaviour
Expected behaviour