canonical / bundle-kubeflow

Charmed Kubeflow
Apache License 2.0
99 stars 48 forks source link

Finalize and publish upgrade notes 1.8 to 1.9 #988

Closed orfeas-k closed 1 month ago

orfeas-k commented 1 month ago

Context

Based on internal document https://docs.google.com/document/d/1Wg32O5PF8RMy7ng7hY9gX37lHnwmszyBt4D2lI_MSjQ/edit

What needs to get done

self-explanatory

Definition of Done

The upgrade notes are published under our docs

syncronize-issues-to-jira[bot] commented 1 month ago

Thank you for reporting us your feedback!

The internal ticket has been created: https://warthogs.atlassian.net/browse/KF-6030.

This message was autogenerated

orfeas-k commented 1 month ago

Performed the upgrade in an Azure one-click deployment environment and this is the final juju status

Model     Controller     Cloud/Region       Version  SLA          Timestamp
kubeflow  azure-uksouth  k8s-cloud/uksouth  3.4.4    unsupported  15:57:24Z

SAAS        Status  Store  URL
grafana     active  local  admin/cos.grafana
prometheus  active  local  admin/cos.prometheus

App                        Version                  Status  Scale  Charm                    Channel          Rev  Address       Exposed  Message
admission-webhook                                   active      1  admission-webhook        1.9/stable       344  10.0.170.68   no       
argo-controller                                     active      1  argo-controller          3.4/stable       545  10.0.201.37   no       
dex-auth                                            active      1  dex-auth                 2.39/stable      548  10.0.33.175   no       
envoy                                               active      1  envoy                    2.2/stable       263  10.0.35.138   no       
grafana-agent-k8s          0.40.4                   active      1  grafana-agent-k8s        latest/edge       80  10.0.205.176  no       logging-consumer: off
istio-ingressgateway                                active      1  istio-gateway            1.22/stable     1127  10.0.109.253  no       
istio-pilot                                         active      1  istio-pilot              1.22/stable     1079  10.0.22.16    no       
jupyter-controller                                  active      1  jupyter-controller       1.9/stable      1038  10.0.180.172  no       
jupyter-ui                                          active      1  jupyter-ui               1.9/stable       961  10.0.242.87   no       
katib-controller                                    active      1  katib-controller         0.17/stable      750  10.0.190.187  no       
katib-db                   8.0.36-0ubuntu0.22.04.1  active      1  mysql-k8s                8.0/stable       153  10.0.235.23   no       
katib-db-manager                                    active      1  katib-db-manager         0.17/stable      713  10.0.13.125   no       
katib-ui                                            active      1  katib-ui                 0.17/stable      713  10.0.242.102  no       
kfp-api                                             active      1  kfp-api                  2.2/stable      1552  10.0.242.2    no       
kfp-db                     8.0.36-0ubuntu0.22.04.1  active      1  mysql-k8s                8.0/stable       153  10.0.86.17    no       
kfp-metadata-writer                                 active      1  kfp-metadata-writer      2.2/stable       617  10.0.173.186  no       
kfp-persistence                                     active      1  kfp-persistence          2.2/stable      1560  10.0.236.166  no       
kfp-profile-controller                              active      1  kfp-profile-controller   2.2/stable      1518  10.0.140.193  no       
kfp-schedwf                                         active      1  kfp-schedwf              2.2/stable      1571  10.0.18.189   no       
kfp-ui                                              active      1  kfp-ui                   2.2/stable      1555  10.0.240.137  no       
kfp-viewer                                          active      1  kfp-viewer               2.2/stable      1586  10.0.97.49    no       
kfp-viz                                             active      1  kfp-viz                  2.2/stable      1504  10.0.56.202   no       
knative-eventing                                    active      1  knative-eventing         1.12/stable      459  10.0.140.29   no       
knative-operator                                    active      1  knative-operator         1.12/stable      433  10.0.123.103  no       
knative-serving                                     active      1  knative-serving          1.12/stable      460  10.0.150.33   no       
kserve-controller                                   active      1  kserve-controller        0.13/stable      597  10.0.35.244   no       
kubeflow-dashboard                                  active      1  kubeflow-dashboard       1.9/stable       659  10.0.233.198  no       
kubeflow-profiles                                   active      1  kubeflow-profiles        1.9/stable       419  10.0.87.43    no       
kubeflow-roles                                      active      1  kubeflow-roles           1.9/stable       240  10.0.18.31    no       
kubeflow-volumes                                    active      1  kubeflow-volumes         1.9/stable       348  10.0.86.250   no       
metacontroller-operator                             active      1  metacontroller-operator  3.0/stable       311  10.0.44.178   no       
minio                      res:oci-image@5102166    active      1  minio                    ckf-1.9/stable   347  10.0.154.106  no       
mlflow-mysql               8.0.36-0ubuntu0.22.04.1  active      1  mysql-k8s                8.0/stable       153  10.0.159.238  no       
mlflow-server                                       active      1  mlflow-server            2.1/stable       466  10.0.7.216    no       
mlmd                                                active      1  mlmd                     ckf-1.9/stable   213  10.0.217.192  no       
oidc-gatekeeper                                     active      1  oidc-gatekeeper          ckf-1.9/stable   423  10.0.154.237  no       
pvcviewer-operator                                  active       1  pvcviewer-operator       1.9/stable       157  10.0.100.71   no      
resource-dispatcher                                 active      1  resource-dispatcher      latest/edge      175  10.0.151.44   no       
seldon-controller-manager                           active      1  seldon-core              1.17/stable      664  10.0.8.200    no       
tensorboard-controller                              active      1  tensorboard-controller   1.9/stable       333  10.0.155.254  no       
tensorboards-web-app                                active      1  tensorboards-web-app     1.9/stable       321  10.0.228.150  no       
training-operator                                   active      1  training-operator        1.8/stable       503  10.0.105.164  no       

Unit                          Workload  Agent  Address      Ports          Message
admission-webhook/0*          active    idle   10.244.0.39                 
argo-controller/0*            active    idle   10.244.2.38                 
dex-auth/0*                   active    idle   10.244.2.39                 
envoy/0*                      active    idle   10.244.2.36                 
grafana-agent-k8s/0*          active    idle   10.244.2.19                 logging-consumer: off
istio-ingressgateway/0*       active    idle   10.244.0.36                 
istio-pilot/0*                active    idle   10.244.1.38                 
jupyter-controller/0*         active    idle   10.244.2.40                 
jupyter-ui/0*                 active    idle   10.244.2.41                 
katib-controller/0*           active    idle   10.244.2.44                 
katib-db-manager/0*           active    idle   10.244.2.42                 
katib-db/0*                   active    idle   10.244.1.9                  Primary
katib-ui/0*                   active    idle   10.244.0.40                 
kfp-api/0*                    active    idle   10.244.2.43                 
kfp-db/0*                     active    idle   10.244.1.27                 Primary
kfp-metadata-writer/0*        active    idle   10.244.0.41                 
kfp-persistence/0*            active    idle   10.244.1.46                 
kfp-profile-controller/0*     active    idle   10.244.0.44                 
kfp-schedwf/0*                active    idle   10.244.1.45                 
kfp-ui/0*                     active    idle   10.244.0.42                 
kfp-viewer/0*                 active    idle   10.244.0.43                 
kfp-viz/0*                    active    idle   10.244.2.47                 
knative-eventing/0*           active    idle   10.244.2.45                 
knative-operator/0*           active    idle   10.244.2.48                 
knative-serving/0*            active    idle   10.244.1.47                 
kserve-controller/0*          active    idle   10.244.2.52                 
kubeflow-dashboard/0*         active    idle   10.244.0.45                 
kubeflow-profiles/0*          active    idle   10.244.2.50                 
kubeflow-roles/0*             active    idle   10.244.2.46                 
kubeflow-volumes/0*           active    idle   10.244.2.37                 
metacontroller-operator/0*    active    idle   10.244.2.10                 
minio/0*                      active    idle   10.244.0.47  9000-9001/TCP  
mlflow-mysql/0*               active    idle   10.244.0.23                 Primary
mlflow-server/0*              active    idle   10.244.0.9                  
mlmd/0*                       active    idle   10.244.2.35                 
oidc-gatekeeper/0*            active    idle   10.244.0.46                 
pvcviewer-operator/0*         active    idle   10.244.2.51                
resource-dispatcher/0*        active    idle   10.244.0.49                 
seldon-controller-manager/0*  active    idle   10.244.1.4                  
tensorboard-controller/0*     active    idle   10.244.1.49                 
tensorboards-web-app/0*       active    idle   10.244.1.48                 
training-operator/0*          active    idle   10.244.2.49     

Integration provider                           Requirer                                       Interface                 Type     Message
argo-controller:grafana-dashboard              grafana-agent-k8s:grafana-dashboards-consumer  grafana_dashboard         regular  
argo-controller:metrics-endpoint               grafana-agent-k8s:metrics-endpoint             prometheus_scrape         regular  
dex-auth:dex-oidc-config                       oidc-gatekeeper:dex-oidc-config                dex-oidc-config           regular  
dex-auth:metrics-endpoint                      grafana-agent-k8s:metrics-endpoint             prometheus_scrape         regular  
grafana-agent-k8s:grafana-dashboards-provider  grafana:grafana-dashboard                      grafana_dashboard         regular  
grafana-agent-k8s:peers                        grafana-agent-k8s:peers                        grafana_agent_replica     peer     
istio-pilot:gateway-info                       kserve-controller:ingress-gateway              istio-gateway-info        regular  
istio-pilot:gateway-info                       tensorboard-controller:gateway-info            istio-gateway-info        regular  
istio-pilot:ingress                            dex-auth:ingress                               ingress                   regular  
istio-pilot:ingress                            envoy:ingress                                  ingress                   regular  
istio-pilot:ingress                            jupyter-ui:ingress                             ingress                   regular  
istio-pilot:ingress                            katib-ui:ingress                               ingress                   regular  
istio-pilot:ingress                            kfp-ui:ingress                                 ingress                   regular  
istio-pilot:ingress                            kubeflow-dashboard:ingress                     ingress                   regular  
istio-pilot:ingress                            kubeflow-volumes:ingress                       ingress                   regular  
istio-pilot:ingress                            mlflow-server:ingress                          ingress                   regular  
istio-pilot:ingress                            oidc-gatekeeper:ingress                        ingress                   regular  
istio-pilot:ingress                            tensorboards-web-app:ingress                   ingress                   regular  
istio-pilot:ingress-auth                       oidc-gatekeeper:ingress-auth                   ingress-auth              regular  
istio-pilot:istio-pilot                        istio-ingressgateway:istio-pilot               k8s-service               regular  
istio-pilot:peers                              istio-pilot:peers                              istio_pilot_peers         peer     
jupyter-controller:grafana-dashboard           grafana-agent-k8s:grafana-dashboards-consumer  grafana_dashboard         regular  
jupyter-controller:metrics-endpoint            grafana-agent-k8s:metrics-endpoint             prometheus_scrape         regular  
katib-db-manager:k8s-service-info              katib-controller:k8s-service-info              k8s-service               regular  
katib-db:database                              katib-db-manager:relational-db                 mysql_client              regular  
katib-db:database-peers                        katib-db:database-peers                        mysql_peers               peer     
katib-db:restart                               katib-db:restart                               rolling_op                peer     
katib-db:upgrade                               katib-db:upgrade                               upgrade                   peer     
kfp-api:kfp-api                                kfp-persistence:kfp-api                        k8s-service               regular  
kfp-api:kfp-api                                kfp-ui:kfp-api                                 k8s-service               regular  
kfp-api:metrics-endpoint                       grafana-agent-k8s:metrics-endpoint             prometheus_scrape         regular  
kfp-db:database                                kfp-api:relational-db                          mysql_client              regular  
kfp-db:database-peers                          kfp-db:database-peers                          mysql_peers               peer     
kfp-db:restart                                 kfp-db:restart                                 rolling_op                peer     
kfp-db:upgrade                                 kfp-db:upgrade                                 upgrade                   peer     
kfp-viz:kfp-viz                                kfp-api:kfp-viz                                k8s-service               regular  
knative-serving:local-gateway                  kserve-controller:local-gateway                serving-local-gateway     regular  
kubeflow-dashboard:links                       jupyter-ui:dashboard-links                     kubeflow_dashboard_links  regular  
kubeflow-dashboard:links                       katib-ui:dashboard-links                       kubeflow_dashboard_links  regular  
kubeflow-dashboard:links                       kfp-ui:dashboard-links                         kubeflow_dashboard_links  regular  
kubeflow-dashboard:links                       kubeflow-volumes:dashboard-links               kubeflow_dashboard_links  regular  
kubeflow-dashboard:links                       mlflow-server:dashboard-links                  kubeflow_dashboard_links  regular  
kubeflow-dashboard:links                       tensorboards-web-app:dashboard-links           kubeflow_dashboard_links  regular  
kubeflow-dashboard:links                       training-operator:dashboard-links              kubeflow_dashboard_links  regular  
kubeflow-profiles:kubeflow-profiles            kubeflow-dashboard:kubeflow-profiles           k8s-service               regular  
metacontroller-operator:metrics-endpoint       grafana-agent-k8s:metrics-endpoint             prometheus_scrape         regular  
minio:metrics-endpoint                         grafana-agent-k8s:metrics-endpoint             prometheus_scrape         regular  
minio:object-storage                           argo-controller:object-storage                 object-storage            regular  
minio:object-storage                           kfp-api:object-storage                         object-storage            regular  
minio:object-storage                           kfp-profile-controller:object-storage          object-storage            regular  
minio:object-storage                           kfp-ui:object-storage                          object-storage            regular  
minio:object-storage                           kserve-controller:object-storage               object-storage            regular  
minio:object-storage                           mlflow-server:object-storage                   object-storage            regular  
mlflow-mysql:database                          mlflow-server:relational-db                    mysql_client              regular  
mlflow-mysql:database-peers                    mlflow-mysql:database-peers                    mysql_peers               peer     
mlflow-mysql:restart                           mlflow-mysql:restart                           rolling_op                peer     
mlflow-mysql:upgrade                           mlflow-mysql:upgrade                           upgrade                   peer     
mlflow-server:metrics-endpoint                 grafana-agent-k8s:metrics-endpoint             prometheus_scrape         regular  
mlmd:grpc                                      envoy:grpc                                     k8s-service               regular  
mlmd:grpc                                      kfp-metadata-writer:grpc                       k8s-service               regular  
oidc-gatekeeper:client-secret                  oidc-gatekeeper:client-secret                  client-secret             peer     
oidc-gatekeeper:oidc-client                    dex-auth:oidc-client                           oidc-client               regular  
prometheus:receive-remote-write                grafana-agent-k8s:send-remote-write            prometheus_remote_write   regular  
resource-dispatcher:secrets                    kserve-controller:secrets                      kubernetes_manifest       regular  
resource-dispatcher:service-accounts           kserve-controller:service-accounts             kubernetes_manifest       regular  
seldon-controller-manager:grafana-dashboard    grafana-agent-k8s:grafana-dashboards-consumer  grafana_dashboard         regular  
seldon-controller-manager:metrics-endpoint     grafana-agent-k8s:metrics-endpoint             prometheus_scrape         regular  
training-operator:metrics-endpoint             grafana-agent-k8s:metrics-endpoint             prometheus_scrape         regular  

Note that relations between mlflow-server and resource-dispatcher are missing because I didn't bump mlflow-server to latest/edge.

orfeas-k commented 1 month ago

Published guide https://charmed-kubeflow.io/docs/upgrade-18-19