Open todeb opened 1 month ago
I don't think this will work automatically. Can you provide some points @datacore-tilangovan ?
did also a test scalling to 2 replicas, in that case etcd-0 was healthy but etcd-1 show errors:
k logs -n openebs openebs-etcd-1
Defaulted container "etcd" out of: etcd, volume-permissions (init)
etcd 10:38:13.72
etcd 10:38:13.72 Welcome to the Bitnami etcd container
etcd 10:38:13.72 Subscribe to project updates by watching https://github.com/bitnami/containers
etcd 10:38:13.72 Submit issues and feature requests at https://github.com/bitnami/containers/issues
etcd 10:38:13.72
etcd 10:38:13.72 INFO ==> ** Starting etcd setup **
etcd 10:38:13.76 INFO ==> Validating settings in ETCD_* env vars..
etcd 10:38:13.77 WARN ==> You set the environment variable ALLOW_NONE_AUTHENTICATION=yes. For safety reasons, do not use this flag in a production environment.
etcd 10:38:13.78 INFO ==> Initializing etcd
etcd 10:38:13.78 INFO ==> Generating etcd config file using env variables
etcd 10:38:13.81 INFO ==> Detected data from previous deployments
etcd 10:38:14.00 INFO ==> Updating member in existing cluster
{"level":"warn","ts":"2024-05-16T10:38:14.099Z","logger":"etcd-client","caller":"v3@v3.5.6/retry_interceptor.go:62","msg":"retrying of unary invoker failed","target":"etcd-endpoints://0xc0000e6a80/openebs-etcd-0.openebs-etcd-headless.openebs.svc.cluster.local:2379","attempt":0,"error":"rpc error: code = NotFound desc = etcdserver: member not found"}
Error: etcdserver: member not found
Also after 1st case when I rollback from 3 to 1 replica for etcd. etcd-0 lost its all data. Fortunately I made a snapshot of etcd before and restored it.
In 2nd case when scalling from 1 to 2 and rollback from 2 to 1. etcd-0 was healthy entire time so data was kept.
I haven't had the time yet, but I wonder if the etcd-operator would help here? https://github.com/openebs/mayastor/issues/1630
Describe the bug After upgrading chart with mayastor.etcd.replicaCount=3 from 1.
etcd-0 member crashloopbackoff with log:
other new deployed etcd members logs:
Current helm values:
Previous helm values:
To Reproduce
Expected behavior all etcd members are healthy
Screenshots If applicable, add screenshots to help explain your problem.
OS info (please complete the following information): openebs.io/version: 2.6.1
Additional context Add any other context about the problem here.