Closed heysarver closed 2 months ago
@heysarver Please try rebooting the node with hcloud, see if it fixes it.
I experience the same problem!
Two things that I noted.
the particular nodes don't have a Private IP
They are both not connected to the private network:
Rebooting solved it but it's still an issue. I added another worker pool and had the same results, all but 1 came up ok and a reboot of that fixed it again.
@heysarver Remove the kured-ttl setting. Remove also the autoscaler version (the default set value is needed).
terraform init -upgrade
Plan B
Make sure the underlying image is good, rebuild it if needed, with the packer command.
Debug cloud-init and what could be happening on boot, ask https://claude.ai for the exact commands and give it the logs.
@JWDobken please create a new issue with all the details.
rebuilding the image seemed to have solved my issue, thank you.
@mysticaltech I've started using it already and have hit my limits on a new account so I'll have to wait to try, but sounds reasonable.
I can confirm this was my issue with kured_options lock-ttl set to 30m.
When I made a new cluster to confirm, I also had to manually open the firewall ports for the nginx ingress load balancer with this config. Any ideas on that or should I open a new issue?
@heysarver Please reframe the issue, I'm not understanding clearly the issues you are still facing.
@mysticaltech I'm having to add rules for the destination nginx-ingress ports manually to the firewall after creating, otherwise all the targets are unhealthy. This causes terraform state to get out of sync.
@heysarver Please open a new issue with the full working kube.tf apart from private info, and steps to reproduce please.
Description
I'm trying to deploy a cluster with 3 or 5 control nodes, both have the same result. N-1 nodes come up successfully but after several terraform destroy and apply plans there's always 1 control node that is stuck in "Waiting for MicroOS to become available..." until terraform times out.
Kube.tf file
Screenshots
Failed Node:
Platform
MacOS, Terraform Cloud