Open scottyeager opened 1 month ago
The node is apparently submitting an invalid IP address to TF Chain (the error is triggered by a simple check that the address is longer than seven characters)
@scottyeager is it possible for you to activate debug log? if possible, we could get more info by looking at this log
"node data have changing, issuing an update node:
https://github.com/threefoldtech/zos/blob/v3.11.3/pkg/registrar/register.go#L210
Probably the IP address itself is empty.
Now i'm looking for the possibilities of this empty IP address.
Probably the IP address itself is empty.
if possible, give the result of this command
ip addr
Hi @iwanbk, this is a node running on mainnet, so there's no chance of debug logging or running any commands via SSH. We might be able to ask the farmer to try booting the node to devnet so some dev can get SSH access. That is assuming the node displays the same behavior.
This appears to have resolved itself for the case at hand, but I still think it's worth looking into how we can prevent it from happening again.
but I still think it's worth looking into how we can prevent it from happening again.
It would be tricky to fix/prevent something when we don't know the root cause yet. But let me check again.
From what i can see, the error could be caused by empty/invalid IP address of the bridge.
Some improvements we probably could do is improving this part https://github.com/threefoldtech/zos/blob/9998be1c6c66c387c106bdee7232cfe7f768ccf4/pkg/registrar/register.go#L113-L116
change the hardcoded zos
to a constant. Other than preventing typo, a constant with good name could give more meaning to the code
Check that the returned zosIps
, make sure that the length of the string representation is >= 7.
In this way, we duplicating the checking in both client(zos) and server (tfchain), but i think it is OK because:
tfchain
Doing IP checking will not solve the real issue, but at least we could have better visibility and avoid wasting TFT
A farmer reported that their node 2358 on mainnet no longer appeared online in the dashboard. I pulled the node logs and saw ongoing errors that the node failed to register with "extrinsic temporarily banned". Since that's caused by insufficient wallet balance for the node, I checked and saw that the node had only 0.0018 TFT and was therefore likely unable to complete transactions.
Then I funded the node wallet with 0.1 TFT and asked the farmer to boot the node up again. This time I was able to observe what had caused the node to drain its wallet:
Just scanning the node logs, I don't see any obvious cause for this error. Maybe there's some clue in here:
We have two problems here: