Open nobuto-m opened 3 months ago
We should probably implement a guard to make sure the Kubernetes resource patch is ready before we try to do anything, like it was done in this PR.
Hey @nobuto-m
I've deployed the same revisions you had:
Model Controller Cloud/Region Version SLA Timestamp
cos microk8s microk8s/localhost 3.5.4 unsupported 17:44:01-03:00
App Version Status Scale Charm Channel Rev Address Exposed Message
alertmanager 0.27.0 active 1 alertmanager-k8s latest/edge 129 10.152.183.108 no
catalogue active 1 catalogue-k8s latest/edge 58 10.152.183.135 no
grafana 9.5.3 active 1 grafana-k8s latest/edge 118 10.152.183.21 no
loki 2.9.6 active 1 loki-k8s latest/edge 163 10.152.183.84 no
prometheus 2.52.0 active 1 prometheus-k8s latest/edge 210 10.152.183.177 no
traefik 2.11.0 active 1 traefik-k8s latest/edge 203 10.152.183.222 no Serving at 192.168.1.240
Unit Workload Agent Address Ports Message
alertmanager/0* active idle 10.1.9.222
catalogue/0* active idle 10.1.9.210
grafana/0* active idle 10.1.9.230
loki/0* active idle 10.1.9.235
prometheus/0* active idle 10.1.9.220
traefik/0* active idle 10.1.9.226 Serving at 192.168.1.240
And I wasn't able to reproduce the issue:
juju show-status-log alertmanager/0 --days 1
Time Type Status Message
18 Sep 2024 17:15:59-03:00 workload waiting installing agent
18 Sep 2024 17:15:59-03:00 juju-unit allocating
18 Sep 2024 17:16:15-03:00 workload waiting agent initialising
18 Sep 2024 17:16:42-03:00 workload maintenance installing charm software
18 Sep 2024 17:16:42-03:00 juju-unit executing running install hook
18 Sep 2024 17:16:45-03:00 juju-unit executing running alerting-relation-created hook
18 Sep 2024 17:16:47-03:00 juju-unit executing running catalogue-relation-created hook
18 Sep 2024 17:16:48-03:00 juju-unit executing running grafana-dashboard-relation-created hook
18 Sep 2024 17:16:50-03:00 juju-unit executing running grafana-source-relation-created hook
18 Sep 2024 17:16:51-03:00 juju-unit executing running ingress-relation-created hook
18 Sep 2024 17:16:52-03:00 juju-unit executing running replicas-relation-created hook
18 Sep 2024 17:16:53-03:00 juju-unit executing running self-metrics-endpoint-relation-created hook
18 Sep 2024 17:16:54-03:00 juju-unit executing running leader-elected hook
18 Sep 2024 17:16:55-03:00 juju-unit executing running alertmanager-pebble-ready hook
18 Sep 2024 17:16:56-03:00 juju-unit executing running data-storage-attached hook
18 Sep 2024 17:17:42-03:00 juju-unit executing running config-changed hook
18 Sep 2024 17:17:53-03:00 juju-unit executing running start hook
18 Sep 2024 17:17:55-03:00 juju-unit executing running alertmanager-pebble-ready hook
18 Sep 2024 17:17:58-03:00 juju-unit executing running ingress-relation-joined hook for traefik/0
18 Sep 2024 17:17:59-03:00 juju-unit executing running alerting-relation-joined hook for prometheus/0
18 Sep 2024 17:18:01-03:00 juju-unit executing running ingress-relation-changed hook for traefik/0
18 Sep 2024 17:18:02-03:00 juju-unit executing running catalogue-relation-joined hook for catalogue/0
18 Sep 2024 17:18:04-03:00 juju-unit executing running grafana-dashboard-relation-joined hook for grafana/0
18 Sep 2024 17:18:04-03:00 juju-unit idle
18 Sep 2024 17:18:05-03:00 juju-unit executing running catalogue-relation-changed hook for catalogue/0
18 Sep 2024 17:18:06-03:00 juju-unit executing running grafana-dashboard-relation-changed hook for grafana/0
18 Sep 2024 17:18:07-03:00 juju-unit idle
18 Sep 2024 17:18:07-03:00 juju-unit executing running self-metrics-endpoint-relation-joined hook for prometheus/0
18 Sep 2024 17:18:08-03:00 juju-unit executing running grafana-source-relation-joined hook for grafana/0
18 Sep 2024 17:18:09-03:00 juju-unit executing running grafana-source-relation-changed hook for grafana/0
18 Sep 2024 17:18:10-03:00 juju-unit executing running alerting-relation-changed hook for prometheus/0
18 Sep 2024 17:18:11-03:00 juju-unit executing running alerting-relation-joined hook for loki/0
18 Sep 2024 17:18:12-03:00 juju-unit executing running self-metrics-endpoint-relation-changed hook for prometheus/0
18 Sep 2024 17:18:13-03:00 juju-unit executing running ingress-relation-changed hook
18 Sep 2024 17:18:16-03:00 workload active
18 Sep 2024 17:18:17-03:00 juju-unit executing running alerting-relation-changed hook for loki/0
18 Sep 2024 17:18:18-03:00 juju-unit idle
I believe that it's a timing issue... How much RAM and CPU does the machine have? 🤔
Please, may you try deploying now from edge
and tell me if you see the same issue?
Bug Description
The charm status gets into error unnecessarily in the middle of deployment.
To Reproduce
Environment
Relevant log output
Additional context
cos-lite_debug.log