pravega / zookeeper-operator

Kubernetes Operator for Zookeeper
Apache License 2.0
364 stars 203 forks source link

Data loss when restarting the lead node #577

Open walleliu1016 opened 9 months ago

walleliu1016 commented 9 months ago

Description

I followed the demo in the readme and created a 3-node cluster. I saw that the node count was 7 through the srvr command. Then I updated the image and restarted the pods in sequence. When it was updated to pod-0, pod-0 was the leader at this time and the update was completed. Afterwards, the node count of pod-0 was 5, and the data in /zookeeper-operator/zookeeper was lost.

image

apiVersion: "zookeeper.pravega.io/v1beta1" kind: "ZookeeperCluster" metadata: name: "zookeeper" spec: replicas: 3 storageType: ephemeral triggerRollingRestart: true

Importance

blocker

Location

(Where is the piece of code, package, or document affected by this issue?)

Suggestions for an improvement

(How do you suggest to fix or proceed with this issue?)