Open motjuste opened 4 months ago
Possibly related to traefik deleting the local app databag when validation of the remote data fails. Traefik listens to created, joined, changed (legacy). Probably should only listen to changed instead.
Details and crashdump artifacts: see https://oil-jenkins.canonical.com/artifacts/f2fe5cfc-ae75-4e82-84ee-0ce88dc484d4/index.html
Bug Description
We at SolutionsQA have noticed a failing deployment of COS where the unit
traefik
stays stuck executing for 4 hours. The debug logs in the relevant crashdump show only the following errors for the unit but no apparent exception details (the full file is attached).To Reproduce
We are using FCE to deploy microk8s on top of MAAS using Juju, and then deploying COS (relevant SKU being
fkb-master-kubernetes-focal-baremetal-kubeflow
). See below for identified recurrence of this issue in other scenarios.Environment
More details available for an example run can be found here. See below for other identified recurrences of this issue at SolutionsQA.
Relevant log output
Additional context
More identified occurrences of this issue can be found here.