nokia / danm

TelCo grade network management in a Kubernetes cluster
BSD 3-Clause "New" or "Revised" License
373 stars 81 forks source link

networkPlugin cni failed to set up pod "danm-installer-mmcqr_kube-system" #243

Closed jdambly-ns closed 3 years ago

jdambly-ns commented 3 years ago

Is this a BUG REPORT or FEATURE REQUEST?: bug

What happened: running the installer job fails to run the pod

  Warning  FailedCreatePodSandBox  2s (x2 over 5s)    kubelet            (combined from similar events): Failed to create pod sandbox: rpc error: code = Unknown desc = failed to set up sandbox container "f704fbd4490652949ee8a6aec4b5506a209599e579d25372e31d768c09d1c349" network for pod "danm-installer-gjhkh": networkPlugin cni failed to set up pod "danm-installer-gjhkh_kube-system" network: Pod manifest could not be parsed with error: failed to get Pod info from K8s API server due to:Get https://10.128.128.1:443/api/v1/namespaces/kube-system/pods/danm-installer-gjhkh: x509: certificate signed by unknown authority

What you expected to happen: installer job runs to completion

How to reproduce it: kubctl apply -f integration/install Anything else we need to know?: I think the issue is in danm-kubeconfig, what creates this kubeconfig?

apiVersion: v1
kind: Config
current-context: default
clusters:
  - cluster:
      certificate-authority-data: <<< this is null/missing

calico is my default cni Environment:

Levovar commented 3 years ago

hi @jdambly-ns! sorry I was on my winter break, and weren't checking my projects that much :) will try to take a look this week and come back to you! after first inspection there might be some issue with the certificate generating and signing scripts

Levovar commented 3 years ago

Looking back at this, I think the installer job's config file is appropriately commented: https://github.com/nokia/danm/blob/master/integration/install/danm-installer-config.yaml#L29

let us know if you were unable to configure correct CA authority cert for the installer based on the instructions