siderolabs / talos

Talos Linux is a modern Linux distribution built for Kubernetes.
https://www.talos.dev
Mozilla Public License 2.0
6.56k stars 523 forks source link

Talosctl reset hanging after checks are completed #8802

Open ChickenIQ opened 4 months ago

ChickenIQ commented 4 months ago

Bug Report

Description

Talosctl may hang after completing the necessary checks after resetting the nodes. This is inconsistent, but I managed to replicate it a handful of times during my testing. Manually canceling results in the desired outcome.

Command used: talosctl reset --reboot --graceful=false --system-labels-to-wipe EPHEMERAL

Logs

◲ watching nodes: [192.168.1.100 192.168.1.99]

smira commented 4 months ago

What are the endpoinds (talosctl config info)?

ChickenIQ commented 4 months ago

What are the endpoinds (talosctl config info)?

I'm currently unable to reach my computer, but they are 192.168.1.100 and 192.168.1.99, that are both are controlplanes.

Could this be because this is not a highly available configuration?

smira commented 4 months ago

You can't reset both controlplanes, this doesn't make any sense in general, as etcd data will be lost.

I think the issue here is probably still valid.

ChickenIQ commented 4 months ago

You can't reset both controlplanes, this doesn't make any sense in general, as etcd data will be lost.

I think the issue here is probably still valid.

My goal is to fully reset the nodes without them going into maintenance mode, so I can keep my secrets valid, then I regenerate them, use the old config to apply them then bootstrap the cluster again, resulting in a clean slate, all automated.

Data loss is not a problem, it is the desired outcome.