Panic randomly occurs on node shutdown, leading to unclean shutdown

Consensys / quorum

A permissioned implementation of Ethereum supporting data privacy

GNU Lesser General Public License v3.0

4.69k stars 1.29k forks source link

Expected behaviour

Panic should not happen on normal node shutdown.

Actual behaviour

panic: sync: WaitGroup is reused before previous Wait has returned randomly happens on node shutdown, leading to unclean shutdown and data loss on the node. I stop one of the non-validator nodes once in a day to safely take a disk snapshot. I have observed this panic message once in a month or two.

Steps to reproduce the behaviour

Launch a QBFT cluster and schedule a normal shutdown once in a day. Sometimes panic: sync: WaitGroup is reused before previous Wait has returned message appears on node shutdown, causing data loss on the node.

Consensys / quorum