oracle-terraform-modules / terraform-oci-oke

The Terraform OKE Module Installer for Oracle Cloud Infrastructure provides a Terraform module that provisions the necessary resources for Oracle Container Engine.
https://oracle-terraform-modules.github.io/terraform-oci-oke/
Universal Permissive License v1.0
157 stars 213 forks source link

Cluster Autoscaler installation fail #919

Open shunsukew opened 6 months ago

shunsukew commented 6 months ago

Community Note

Terraform Version and Provider Version

Terraform v1.6.6
on darwin_arm64
+ provider registry.terraform.io/hashicorp/cloudinit v2.3.3
+ provider registry.terraform.io/hashicorp/helm v2.12.1
+ provider registry.terraform.io/hashicorp/http v3.4.2
+ provider registry.terraform.io/hashicorp/null v3.2.2
+ provider registry.terraform.io/hashicorp/oci v5.33.0
+ provider registry.terraform.io/hashicorp/random v3.6.0
+ provider registry.terraform.io/hashicorp/time v0.10.0
+ provider registry.terraform.io/oracle/oci v5.30.0

Your version of Terraform is out of date! The latest version
is 1.8.2. You can update by downloading from https://www.terraform.io/downloads.html

Affected Resource(s)

module.{module_name}.module.extensions[0].null_resource.cluster_autoscaler[0]

Terraform Configuration Files

  // Autoscaler
  // https://oracle-terraform-modules.github.io/terraform-oci-oke/guide/extensions_cluster_autoscaler.html
  cluster_autoscaler_install           = true
  cluster_autoscaler_namespace         = "kube-system"
  cluster_autoscaler_helm_version      = "9.36.0"
  cluster_autoscaler_helm_values       = {}
  cluster_autoscaler_helm_values_files = []

  worker_pools = {
    oke-vm-autoscaler = {
      size = 1, // Do not scale out. Scale out below worker pool such as oke-vm-optimized.
      description      = "Node pool with cluster autoscaler scheduling allowed",
      allow_autoscaler = true,
      shape = "VM.Standard3.Flex"
      ocpus = 1,
      memory = 4,
      boot_volume_size: 50,
    },
    ...
}

Expected Behavior

Autoscaler is properly installed.

Actual Behavior

Installation fails due to remote-exec error. error message is here. Error message says, It looks like you are trying to use a client-go credential plugin that is not installed. which should be installed on the node.

Steps to Reproduce

Create OKE cluster with version 5.1.5, and set autoscaler config.

hyder commented 6 months ago

hi, Thanks for logging this issue. In order for autoscaler installation to work, kubectl needs to be working too and it depends on oci-cli for authentication. The logs show that oci cli was not installed yet.

Can you please check on the operator host and confirm if that's the case?

shunsukew commented 6 months ago

Hello, Yes, oci command is available on the operator host.

ubuntu@o-dev-japaneast-oke:~$ oci --version
3.40.1

Also, kubectl is available on the operator host. However, oci-cli was installed manually because it was not installed by default when provisioning by this oke module.