Open pierrebeaucamp opened 5 years ago
This issue seems to be more severe than I initially thought. Twice now in the last 36 hours, our entire cluster required recovery (through re-bootstrapping the masters). Kubectl suddenly stopped working (error: You must be logged in to the server
, similar to https://github.com/coreos/tectonic-forum/issues/161) and ingress stopped working. I see myself forced to migrate to another Kubernetes solution if this is not being addressed soon
Got a similar issue on a metal setup that was updated to 1.9.6-tectonic.2. I type in user and pass, get a glimpse of the dashboard and am immediately redirected to the login prompt again.
im also facing issue with 1.9.6-tectonic.2 fresh install, masters won't go online, behind elb something is differently wrong.
What keywords did you search in tectonic-installer issues before filing this one?
tectonic-identity. I also looked through recently opened and closed bugs.
Is this a BUG REPORT or FEATURE REQUEST?
BUG REPORT
After our cluster updated itself to 1.9.6-tectonic.2, we're getting a lot of alerts about tectonic-identity pods frequently restarting.
(This pattern repeats itself since upgrading to the 1.9.6-tectonic.2 release)
Versions
terraform version
): Terraform v0.11.8What happened?
The cluster, having auto-update enabled, updated itself to 1.9.6-tectonic.2. Afterwards the tectonic-identity pods started failing. When they're up and running, everything works fine and we can access and interact with the tectonic console. When they're down, we're getting 503 errors when accessing the console.
What you expected to happen?
The tectonic-identity pods should be stable
How to reproduce it (as minimally and precisely as possible)?
Update to 1.9.6-tectonic.2
Anything else we need to know?
All other components of our cluster operate normally