Closed thoro closed 4 months ago
Turns out I set the podSubnets network by mistake to include the node network!
cluster:
network:
podSubnets:
- 10.12.16.0/19
Correct would have been 10.12.32.0/19
I would suggest to add a log line here: https://github.com/siderolabs/talos/blob/cc345c8c9413692148360684390c910de9e94748/internal/app/machined/pkg/controllers/etcd/spec.go#L137
so that it's easier to be found.
Talos 1.8 already has diagnostics which help in this particular case. So thanks for reporting, and 1.8 will make issues like that more obious
Bug Report
Description
I'm unable to bootstrap my etcd cluster.
talosctl --talosconfig=./talosconfig -e at-cl02-h03 -n at-cl02-h03 bootstrap
Exits successfully, but on the corresponding node the only output is the following:
Service itself also is "Waiting for spec"
talosctl get etcdconfigs
talosctl get etcdspecs
It seems that the controller never created the etcdspecs, possibly because of the IPs?
Based on code, and no errors, I assume it runs into this continue here: https://github.com/siderolabs/talos/blob/cc345c8c9413692148360684390c910de9e94748/internal/app/machined/pkg/controllers/etcd/spec.go#L203
Relevant parts of the controlplane.yaml:
Tried once like this:
And once like this:
And the node addresses:
talosctl get nodeaddresses
Logs
support.zip
Environment
talosctl version --nodes <problematic nodes>
] 1.7.5kubectl version --short
] 1.30 - but not bootstrapped