Closed jagerman closed 1 year ago
On further investigation it appears that this node has an outdated bootstrap file. So the connectivity issue makes sense, but there really needs to be a serious warning when the nodedb is close to empty and we are active.
Lokinet appears to track the decomm/recomm states fine, but isn't reachable by anyone else on the network until it gets restarted.
Yes, I narrowly escaped a deregistration once, due to this bug. I also know a couple of other ops who weren't so lucky.
Now, when I bring up a shared node and I don't know when exactly it will finally go live on the network, I insulate myself from this bug with the following job in /etc/crontab
.
*/15 * * * * root systemctl restart lokinet-router
TODO:
did this get fixed?
A couple of times now I've had a service node get decommissioned due to Lokinet unreachability after an unlock-and-reregister. Connectivitity seems fine after a restart of lokinet.
Timeline:
Up to the re-registration we have, every 30 seconds, the expected whining:
which is fine, we aren't a registered service node. These stop after the re-registration, there are no log statements at all for the next 5.5 hours, then we got deregistered because of failing lokinet connectivity. Other nodes at this time were reporting that Lokinet is unreachable; there are no log statements (logging is at warning level) for 5.5 hours, then a decomm, a recomm, and then a dereg.
Lokinet appears to track the decomm/recomm states fine, but isn't reachable by anyone else on the network until it gets restarted.