Allow running CRD conversion webhook server without controllers

Oats87 commented 10 months ago

What would you like to be added (User Story)?

As an operator, I would like to be able to manipulate cluster-api objects in certain situations without having them immediately reconciled by the capi-controller-manager

Detailed Description

Today, the conversion webhooks are nested with the capi-controller-manager. This proves problematic in scenarios where the management cluster is being restored from an object-level backup, using a tool like https://github.com/rancher/backup-restore-operator or https://velero.io/ as the conversion webhook on the CRD will fail if the webhook endpoint is not available.

During these restoration operations, it is desirable to be able to run the conversion webhook server while "paving" the objects into place, but not having immediate reconciliation actions take place on them until the restoration is complete. I am thinking for example, the case where a MachineSet has child Machines and if the MachineSet object is restored while controllers are running, it could spawn new Machine objects that have not yet been restored.

There are a few workarounds to this, namely:

Annotate all cluster-api objects with cluster.x-k8s.io/paused=true
Set a bogus watch filter value
Set the concurrency for all of the controllers to 0 (not tested but theoretically this works?)

but all of them are less than ideal solutions for a "robust" DR strategy.

Anything else you would like to add?

No response

Label(s) to be applied

/kind feature /area provider/core

k8s-ci-robot commented 10 months ago

This issue is currently awaiting triage.

If CAPI contributors determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available [here](https://git.k8s.io/community/contributors/guide/pull-requests.md). If you have questions or suggestions related to my behavior, please file an issue against the [kubernetes/test-infra](https://github.com/kubernetes/test-infra/issues/new?title=Prow%20issue:) repository.

killianmuldoon commented 9 months ago

I think the current recommended way to do this is to use the paused annotation - what are the limitations of this that make it a less than ideal solution?

mdbooth commented 9 months ago

I think the current recommended way to do this is to use the paused annotation - what are the limitations of this that make it a less than ideal solution?

I'm assuming the requirement to set the paused annotation on all objects rather than, e.g. just the cluster object?

@Oats87 You say:

it is desirable to be able to run the conversion webhook server while "paving" the objects into place,

Can you expand on that?

Oats87 commented 9 months ago

I think the current recommended way to do this is to use the paused annotation - what are the limitations of this that make it a less than ideal solution?

While using the paused annotation is certainly a solution, "in the trenches" it can be pretty difficult to guarantee the paused annotation is set on the object during restoration.

There are a few cases where a user may be trying to pave objects back into place, specifically off the top of my head I can think of

Restoring a cluster that was "backed up" by using YAML
Using an object-level backup/restoration tool such as Velero or https://github.com/rancher/backup-restore-operator/

I believe Velero has a method to inject json patches to the objects being restored which would allow for adding the annotation, but I think you'd have to manually unpause all objects after performing the restoration.

@Oats87 You say:

it is desirable to be able to run the conversion webhook server while "paving" the objects into place,

Can you expand on that?

Sure. In my specific case we need to be able to perform an object level restoration (etcd restoration is not possible in managed Kubernetes providers such as EKS/AKS/GKE in our case) which mandates the need to restore objects into place. I say "pave" because we're taking a fresh cluster and restoring i.e. paving objects into it. If the object apiVersion is at the "pivot" version of the CRD i.e. the stored version, the conversion webhook is not necessary, if the object being restored requires conversion the conversion webhook server must be running in order to perform the restoration.

sbueringer commented 9 months ago

What happens if you run the CAPI controllers with 0 controller replicas?

mdbooth commented 9 months ago

What happens if you run the CAPI controllers with 0 controller replicas?

Then you wouldn't be running the conversion webhooks either because they're both under the same manager.

sbueringer commented 9 months ago

Oh sorry. What I meant was to run the CAPI controllers with 0 concurrency (you still run a Pod but without any controller workers)

Oats87 commented 9 months ago

Oh sorry. What I meant was to run the CAPI controllers with 0 concurrency (you still run a Pod but without any controller workers)

Testing v1.4.4 I see that it defaults count=1 if --<controller>-concurrency=0: i.e.

<snip>
    spec:
      containers:
      - args:
        - --leader-elect
        - --metrics-bind-addr=localhost:8080
        - --feature-gates=MachinePool=false,ClusterResourceSet=false,ClusterTopology=false,RuntimeSDK=false,LazyRestmapper=false
        - --cluster-concurrency=0
        command:
        - /manager
        env:
<snip>

results in:

I0912 17:12:11.918417       1 controller.go:228] "Starting workers" controller="cluster" controllerGroup="cluster.x-k8s.io" controllerKind="Cluster" worker count=1

sbueringer commented 7 months ago

So best workaround as of now is probably to set the watch-filter value to some selector that doesn't match anything (like you listed above). Which should be a supported use case of that flag based on it's description:

fs.StringVar(&watchFilterValue, "watch-filter", "",
        fmt.Sprintf("Label value that the controller watches to reconcile cluster-api objects. Label key is always %s. If unspecified, the controller watches for all cluster-api objects.", clusterv1.WatchLabel))

Basically, "If you want to reconcile nothing, set the filter to match nothing"

k8s-triage-robot commented 4 months ago

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot commented 3 months ago

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle rotten
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

fabriziopandini commented 3 months ago

/lifecycle frozen /priority backlog /triage accepted

Looking for ways to improve how to recovery from disasters is always valuable. However, I'm debated if a long-term solution for this issue should be implemented in controller runtime - thus standardizing this across all the controllers - or if it should be implemented in CAP* by skipping setupReconcilers.

kubernetes-sigs / cluster-api