p2p discovery: public P2P nodes removed from boot nodes after being restarted

System information

Geth version: latest CL client & version: latest OS & Version: Linux

Expected behaviour

Expected: whenever restarting p2p node, it can rejoin the boot node easily.

Actual behaviour

Actual: When it comes to NLB, It's really very difficult for p2p to rejoin in boot node.

Steps to reproduce the behaviour

I built a small private network of eth for testing. One node as boot node hidden behind a network load balance service (NLB), Two as p2p nodes , also behind a NLB. The network looked like this: opbnb P2P 网络不通问题 (1)

When I finished the network, it works fine. But When I restart the p2p nodes, they were removed by boot node and never came back!

Backtrace

Here is the debug logs from p2p node when restarting: "started discovery service"

Here is that from boot node when removing the p2p node: "dead node"

And here are the configurations of p2p node and boot node, p2p node:

     --p2p.sync.req-resp 
     --p2p.listen.ip=0.0.0.0
     --p2p.listen.tcp=9003
     --p2p.listen.udp=9003
     --p2p.priv.raw={priv key}
     --p2p.advertise.ip={public ip}

boot node:

    --p2p.listen.ip=0.0.0.0
    --p2p.listen.tcp=9003
    --p2p.listen.udp=9003
    --p2p.priv.raw={priv key}
    --p2p.advertise.ip={public ip}

We can see that the p2p node was restarted success at "09:18:01" and soon removed after 3 seconds at "09:18:04". The log "Removed dead node" means that two things had been done successfully within 3 seconds:

boot node handshake with p2p node.
boot node add p2p node into its table.
boot node send a PING to p2p node.

Finally i find the root cause: the first PING packet sent by boot node was transfer to the previous pod ip of p2p nodes, in the NLB of the p2p node side.Illustrated below:

Since it need sometime to refresh the router info for NLB(usually more than 3 seconds), the P2P node was very difficult to received the first PING from boot node. I am wondering that: Shall we PING more times rather than only one time before we confirm that the peer is not alive? Like After 6 times PING failed then we say it's dead ?

I can understand that a high quality of network condition can be ensure by "PING only one time and require PONG within 700 millseconds", but it's really a little tricky that they can't be be placed behind NLB....

ethereum / go-ethereum