weaveworks / sa-demos

This repository is dealing with the Weave GitOps demos and demo environments. Please use it to file and update issues that you are seeing. You can also put your demo guides and demo scripts here.
1 stars 0 forks source link

CAPI stability issue. One error in a template, or the provisioning of any cluster stops all capi related work. #28

Closed darrylweaver closed 1 year ago

darrylweaver commented 2 years ago

In the DemoEnvs, we have seen that specially during configuration time of a new provider, there are errors popping up when deploying clusters. You need to go through the motions and hit all the obstacles and configure them out of the way. Each of these errors will halt the flux-system kustomization.

Only command line investigation and manual intervention will bring the flux-system kustomization and thus the capi deployment system back into operation.

This makes CAPI and cluster provisioning brittle. Please discuss and suggest ideas on how to improve this.

One idea might be to run a kustomization for each type of provider. This would possibly contain errors to the provider in question. Liquid Metal would only impact LM provisioning, AKS only Azure and so on.

LutzLange commented 2 years ago

Putting different provider cluster definitions into different namespaces might allow for this separation.

LutzLange commented 2 years ago

It is not so much different namespaces as different kustomizations for different providers that would help here.

darrylweaver commented 1 year ago

keeping this issue to raise with product at a later date