Closed peterlimg closed 2 years ago
Shutting down miners all at once is the problem perhaps, when they were force closed, the state of last finalized block may not be persisted to db yet . While sharders do have persisted those state changes, and even one round ahead. So after all miners were restarted, they will fetch the latest finalized block from sharders, and try to start the block from it, but new blocks could not be verified due compute state failed
error described above.
We can fix this by adding an endpoint to sharder for getting state changes. Currently, we can only fetch state changes from miners.
After restarting a cluster to update docker image, the chain got stuck. Because miners are all failing to compute state to verify received blocks. It failed to get the state changes of previous blocks.