Closed dparv closed 1 month ago
@dparv , in the STR you are not using --trust
.
Can you confirm if you're trusting the app?
Yes, all apps are deployed using trust, this fails on random occasions, it's not always failing.
Yes, all apps are deployed using trust, this fails on random occasions, it's not always failing.
Alright. I'm trying on AKS then
Yes, all apps are deployed using trust, this fails on random occasions, it's not always failing.
Alright. I'm trying on AKS then
btw, did you ever saw that other than on AKS?
only testing kubeflow deployment extensively on AKS, so that's where I've hit it, don't know if it's happening on other k8s clusters
The root cause is that the *-pebble-ready
handler on mysql_provider
fails on a httpx error - connection refused to k8s api, failing the event, which refrains juju from committing databag keys from other(s) handler(s) observing the same event.
This will render a state mismatch between the charm and the workload, preventing any further self maintenance to take place.
The root cause is that the
*-pebble-ready
handler onmysql_provider
fails on a httpx error - connection refused to k8s api, failing the event, which refrains juju from committing databag keys from other(s) handler(s) observing the same event. This will render a state mismatch between the charm and the workload, preventing any further self maintenance to take place.
The funny part, I didn't notice this issue on preparing https://charmhub.io/mysql-k8s/docs/h-deploy-aks Strange...
The root cause is that the
*-pebble-ready
handler onmysql_provider
fails on a httpx error - connection refused to k8s api, failing the event, which refrains juju from committing databag keys from other(s) handler(s) observing the same event. This will render a state mismatch between the charm and the workload, preventing any further self maintenance to take place.The funny part, I didn't notice this issue on preparing https://charmhub.io/mysql-k8s/docs/h-deploy-aks Strange...
It's not something that shows up very often. @dparv was kind enough to give me access to an environment where it happened. curling the the k8s endpoint and manually triggering the hook after the error did not render the error, hence the conclusion of some transient issue.
Steps to reproduce
Expected behavior
charm in active/idle state
Actual behavior
charm in unknown idle/state
Versions
K8s: AKS 1.28.10
Juju CLI: 3.4.4
Juju agent: 3.4.4
Charm revision: 153
Log output
Juju debug log:
Kubectl debug log:
Full traces: https://pastebin.canonical.com/p/R9Bf347V6S/
Additional context