Open PenelopeFudd opened 1 year ago
Thanks for raising this detailed report @PenelopeFudd. Reviewing your notes I agree there are some areas we could expand the recovery documentation, namely ensure people are aware of the directory creation and ownership requirements.
I also like the suggestion on the more meaningful error message for snapshot restore permission issues, that would need to be completed in the main etcd repo. However I do note you're using etcd 3.2.26
which is quite old, we would need to verify if that error has already been improved in later releases.
Thanks for reporting @PenelopeFudd and +1 @jmhbnz - v3.2 is not supported so it would be great if you can try v3.5 (the doc you are using) or main branch.
I have a three-node etcd cluster (used with Patroni), and one node decided to break.
Read through https://etcd.io/docs/v3.5/op-guide/recovery/ and wasn't able to get the node to work.
Errors:
Did a bunch of googling, found lots related to Kubernetes, found a few for standalone clusters and/or older versions.
Finally figured out my problems (etcd-related, anyway):
Notes:
ETCDCTL_API=3
environment variable shouldn't be necessary, it looks antiquated--data-dir
value was gleaned from looking at a good nodejournalctl
was:etcd[241163]: error listing data dir: /var/lib/etcd/default
etcd[241163]: cannot access '/var/lib/etcd/default': Permission denied
it'd be appreciated. :-)/etc/default/etcd
file contained a number of environment variables that etcd usesEnvironment: Ubuntu 22.04, etcd 3.2.26
Thanks for a great program!