Invalid kubeconfig token not being renewed

deanrock commented 4 years ago

I've created DO cluster 10 days ago. Yesterday, terraform plan couldn't refresh state, since K8s API started returning Unauthorized.

expires_at in the existing state was set to June 16th (it worked without issues earlier yesterday), so renewal was not triggered here: https://github.com/terraform-providers/terraform-provider-digitalocean/blob/fd9e7b8b8156599799c7b2e636f68a529929b3bf/digitalocean/resource_digitalocean_kubernetes_cluster.go#L294

I'm fairly certain I didn't manually delete doks API token, so I'm not completely sure why it didn't work anymore.

Terraform Version

Terraform v0.12.26
+ provider.archive v1.3.0
+ provider.digitalocean v1.19.0
+ provider.helm v1.2.2
+ provider.kubernetes v1.11.3

Affected Resource(s)

digitalocean_kubernetes_cluster

Terraform Configuration Files

resource "digitalocean_kubernetes_cluster" "cluster" {
  name    = var.prefix
  region  = var.region
  version = "1.17.5-do.0"
  tags    = []

  node_pool {
    name       = "${var.prefix}-worker"
    size       = "s-1vcpu-2gb"
    auto_scale = true
    min_nodes  = 1
    max_nodes  = 5
  }
}

provider "kubernetes" {
  load_config_file = false
  host             = digitalocean_kubernetes_cluster.cluster.endpoint
  token            = digitalocean_kubernetes_cluster.cluster.kube_config[0].token
  cluster_ca_certificate = base64decode(
    digitalocean_kubernetes_cluster.cluster.kube_config[0].cluster_ca_certificate
  )
}

resource "kubernetes_namespace" "ingress" {
  metadata {
    name = "ingress"
  }
}

Expected Behavior

We could check if kube_config token is still valid, and force renew it otherwise.

Actual Behavior

terraform plan fails with Error: Unauthorized error.

Important Factoids

Removing cluster from state, and importing it solves the problem.

nrmitchi commented 3 years ago

@deanrock I'm not 100% sure, but this looks very similar to a problem I deal with every week, and you could try a terraform refresh rather than removing/adding to the state.

dmikalova commented 2 years ago

This also happens to me on a weekly basis. Running plan and apply on the cluster also fixes the issue for me. Note that the kube_config generated by doctl kubernetes cluster kubeconfig save "$CLUSTER_ID" does not have this issue.

atmosx commented 11 months ago

We've hit the same issue today.

andrewsomething commented 11 months ago

Unfortunately, we can not recommend using interpolation to pass credentials from a digitalocean_kubernetes_cluster resource to the Kubernetes provider. Generally the cluster resource should not be created in the same Terraform module where Kubernetes provider resources are also used. There are warnings against this approach in the docs for both the DigitalOcean and Kubernete's providers.

https://registry.terraform.io/providers/digitalocean/digitalocean/latest/docs/resources/kubernetes_cluster#kubernetes-terraform-provider-example

https://registry.terraform.io/providers/hashicorp/kubernetes/latest/docs#stacking-with-managed-kubernetes-cluster-resources

The most reliable way to configure the Kubernetes provider is to ensure that the cluster itself and the Kubernetes provider resources can be managed with separate apply operations. Data-sources can be used to convey values between the two stages as needed.

The root issue lies with the order in which Terraform itself evaluates the provider blocks vs. actual resources.

atmosx commented 11 months ago

In case anyone finds this helpful: in our case the problem was a configuration drift between the cluster and our terraform state. It is strange and I can't be sure how it happened but it did. Once we fixed the drift problem by removing from the config the resources that were erroneously part of the config, everything worked smoothly, the token got updated automatically.

digitalocean / terraform-provider-digitalocean