Closed v4w5gse65g874 closed 1 year ago
I have created a workaround via a script. I'm going to close this task out; if someone else has this issue I'll leave the script here. Note that I realize there's a service script in the CLN repo; but my installation is unique and I didn't want to hack the service script up.
while true
do
if ! pgrep -x "lightningd" > /dev/null
then
/usr/local/bin/lightningd
fi
done
Lightning version:
v23.05.2
Plugins commit id:ce078bb
Issue: I was having issues with bufferbloat on my network and enabled SQM via OpenWRT. I have noticed since then that after a few days my lightning node would crash because the backup plugin seems to ... potentially ... timeout. I have the sql file connected via an sshfs mount to a digital ocean droplet. I suspect that during some bloat the write attempt with the backup plugin needs to "wait" and it must be sensitive to needing to wait and timesout immediately, causing the backup plugin to fatal causing the node to crash. I must state that, sadly, I have updated my lightning node and plugins repo at the same time I enabled SQM, so I'm not 100% confident this issue is due to SQM.
Logs: