Open bashoshaun opened 7 years ago
The customer would like to add the following:
Perhaps it could also mention that while no process has the LOCK files open after riak stop, they do persist in the file system, and also that the file system is xfs on thinly provisioned lvm logical volume, which is used to create a snapshot during the node stop.
A customer occasionally runs into a problem where their node will not start back up after it's been shutdown to perform a backup.
On startup, they see this kind of output in console.log:
I've attached the full console.log file for context.
The customer has also implemented some basic debugging in their startup script as it was initially thought a process may have been holding a lock on these console.log.txt
files. This is an example of what they looked like when the node was up and running normally:
lsof
:After the stop, there are no leveldb entries in the output of
lsof
. Similarly, there are no beam processes running.So when Riak restarts, there's no obvious reason why there should be a failure. When the customer then manually attempts a restart after realising the node is down, it comes up fine.
Customer is running RHEL7.