stackabletech / t2

A platform that helps with integration tests and troubleshooting/support
Apache License 2.0
3 stars 2 forks source link

Increase K3s cluster stability #371

Open backstreetkiwi opened 1 year ago

backstreetkiwi commented 1 year ago

During our research for https://github.com/stackabletech/t2/issues/368, we tried to experiment with 2 long-running K3s clusters. Unfortunately, they did not really run for a long time but crashed pretty soon.

Symptoms:

The post-mortem analysis was not so easy because the journals were gone after reboot (see https://github.com/stackabletech/infrastructure/issues/59)

In this task, we should:

backstreetkiwi commented 1 week ago

We switched to managed K8s completely, maybe also because of problems like this. I guess we could close this issue as well, @Jimvin ?