c0c0n3 / kitt4sme.live

On a mission to bring AI to the shop floor: https://kitt4sme.eu/
MIT License
1 stars 28 forks source link

Issues creating a local cluster #339

Closed beltramoeinventio closed 1 year ago

beltramoeinventio commented 1 year ago

Describe the bug

We tried to install a local cluster on Ubuntu server 20.04 following the suggested guide, but many services looks not run correctly

To Reproduce

Steps to reproduce the behavior: follow this guide ('Building your own cluster'): https://github.com/c0c0n3/kitt4sme.live/blob/main/docs/bootstrap.md

after all step done, we verified the status of pods (kubectl get pod --all-namespaces) but many service look have some issues (either after waiting an hour to let system starting): kubectl get pod --all-namespaces NAMESPACE NAME READY STATUS RESTARTS AGE default mosquitto-7c756d68b7-ktdds 0/2 Init:0/1 0 14h argocd argocd-redis-6d8cffcc47-zzx75 1/1 Running 1 15h argocd argocd-applicationset-controller-c5c445f85-tcj8f 1/1 Running 1 15h argocd argocd-dex-server-6ccdfffd99-6n4wt 1/1 Running 1 15h argocd argocd-notifications-controller-7954d65d56-h4j49 1/1 Running 1 15h istio-system istio-egressgateway-756d4db566-j5wg4 1/1 Running 1 15h default sads-7f6cc4f8fd-kcqnl 2/2 Running 2 15h istio-system prometheus-77b49cb997-m55mp 2/2 Running 2 14h kube-system calico-kube-controllers-f7868dd95-rw2sq 1/1 Running 2 15h default opa-84db9f64db-zzj9g 2/2 Running 3 14h kube-system reloader-55885ccf44-lcxjb 1/1 Running 1 14h default roughnator-7569f94698-5rgd9 2/2 Running 2 14h kube-system calico-node-px8rb 1/1 Running 2 15h kube-system coredns-7f9c69c78c-28k66 1/1 Running 2 15h default mongodb-6dd4bf78d9-h57n6 2/2 Running 3 14h default datasheet-frontend-56b8f97d45-9fdg2 2/2 Running 2 14h default viqe-54b787b957-frsdf 2/2 Running 2 15h kube-system sealed-secrets-controller-7684c7b86c-7vgzg 1/1 Running 2 14h default grafana-5c5ccc98bd-q9bbh 2/2 Running 2 15h default ulagent-7c894c4458-mznmq 2/2 Running 2 14h istio-system istiod-5847c59c69-wmx4x 1/1 Running 1 15h istio-system istio-ingressgateway-8577c57fb6-vvxdp 1/1 Running 1 15h argocd argocd-application-controller-0 1/1 Running 1 15h default jsonagent-699968f689-vjpbt 2/2 Running 2 15h istio-system grafana-68cc7d6d78-6ktx9 1/1 Running 1 15h istio-system jaeger-5d44bc5c5d-f564g 1/1 Running 1 14h default crate-0 2/2 Running 2 14h argocd argocd-repo-server-59f47d7dfc-8rtjv 1/1 Running 1 15h default datasheet-backend-689d789747-c454l 1/2 CreateContainerConfigError 1 14h argocd argocd-server-85cfffc556-vztsk 1/1 Running 1 15h default sads-offline-6848df7d89-g6swb 2/2 Running 3 15h default profilers-55c9bd4bbd-lgx6z 0/2 Init:CreateContainerConfigError 0 14h default orion-586c7b7499-bp8dk 2/2 Running 6 15h default dazzler-5949858c56-5zrzw 2/2 Running 3 15h kube-system hostpath-provisioner-566686b959-bs8qq 1/1 Running 18 15h default static-page-5b9c4c4957-ck4sg 2/2 Running 3 14h default matchmaking-8b448d67d-2ccnd 1/2 CreateContainerConfigError 1 14h default keycloak-5c687cfcb5-nbxsp 1/2 CreateContainerConfigError 1 14h default postgres-74487cff86-8b65t 1/2 CreateContainerConfigError 1 14h default aq-7944fd7c46-t2lbl 0/2 Init:CreateContainerConfigError 0 14h default quantumleap-757dbd48d8-smxb6 0/2 Init:CreateContainerConfigError 0 14h istio-system kiali-fd9f88575-nnlqc 1/1 Running 1 14h default platform-configurator-85df8cb9dc-8wjnj 1/2 ImagePullBackOff 1 14h default fams-5c87b55d58-x2j6t 1/2 ErrImagePull 1 14h default intervention-manager-b44d5697d-tprf4 1/5 ImagePullBackOff 1 14h

it looks to have different errors. ie.e of services datasheet-backend-689d789747-c454l or matchmaking-8b448d67d-2ccnd (CreateContainerConfigError) in the log we have same error of missing user: Error: secret "postgres-users" not found

for service mosquitto-7c756d68b7-ktdds (always in init state): MountVolume.SetUp failed for volume "mosquitto-passwd" : secret "mosquitto-passwd" not found

ie.e of services platform-configurator-85df8cb9dc-8wjnj (ImagePullBackOff) in the log we have error to access docker repository: Failed to pull image "gitlab-core.supsi.ch:5050/dti-isteps/spslab/human-robot-interaction/kitt4sme/platform-configurator:0.4.1": rpc error: code = Unknown desc = failed to pull and unpack image "gitlab-core.supsi.ch:5050/dti-isteps/spslab/human-robot-interaction/kitt4sme/platform-configurator:0.4.1": failed to resolve reference "gitlab-core.supsi.ch:5050/dti-isteps/spslab/human-robot-interaction/kitt4sme/platform-configurator:0.4.1": failed to authorize: failed to fetch anonymous token: unexpected status: 403 Forbidden

Expected behavior

A clear and concise description of what you expected to happen. All services (except Keycloak, mentioned explicity to not run) running correctly

Additional context

There are some missing configuration steps that shall be done not mentioned in guide?

beltramoeinventio commented 1 year ago

Fixed using the guide in opencalls branch