Open OneilYang opened 9 months ago
Is there any overtime check in the lotus worker status?
I supposed the overtime check is necessary, it should save application for many reasons including this one.
Hi, the stuck "worker" actually is the one thread of lotus daemon, not the lotus-worker process, I suppose the label should be "area/lotus..." or something about lotus. tks
and seems I can reproduce it with long time running on many machines, if you want to debug it. please let me know.
I found lots of user have this issue, and seems they dont know how it happened: one of fvm stuck --> now sync still works until splistore start --> splitsotre will check sync status, so lotus is over they just feel the splitstore is not stable enough, actually it's the problem about sync issue;
how to resolve it:
Checklist
Latest release
, the most recent RC(release canadiate) for the upcoming release or the dev branch(master), or have an issue updating to any of these.Lotus component
Lotus Version
Repro Steps
For now, "lotus sync wait" only displays this worker's output, but chain sync seems ok('lotus-miner info' displays sync ok);
if use splitstore, it will check "sync wait" in the compaction, and then lotus will stuck entirely.
Describe the Bug
For now, "lotus sync wait" only displays this worker's output, but chain sync seems ok('lotus-miner info' displays sync ok);
if use splitstore, it will check "sync wait" in the compaction, and then lotus will stuck entirely.
Logging Information