Closed yorickdowne closed 3 years ago
I can't get the node to work anymore, regardless of docker-compose down
, removing data, etc. It's completely stuck with setup like this: https://github.com/stereum-dev/ethereum2-docker-compose/blob/prater/compose-examples/lighthouse-only/override-examples/docker-compose.ssv-no-geth.override.yaml#L20 (no host-network, just a plain docker network with ports exposed)
Unrelated to your issue: You may want to make sure 12,000 is udp not tcp in your compose file.
I think the only reason this works for me is that I am in docker swarm and there's a slight delay when querying the port through ingress, which allows the node to "come up" enough to respond. At least that's my theory. I can't see why it would fail intermittently with an overlay network and consistently with a host network, otherwise.
I think your setup is a host network: Plain docker doesn't have overlay networks and ingress routing.
Resolved with 0.1.2
Tested on
0.0.9
, the node can fail in the following way during startup. This happens rarely when running with an ingress network, and always when running withhost
mapping. More on that further down.The setup looks like this, and with the
host
commented out and going through the ingress load balancer, this failure is rare. When going forhost
, it happens every time.Which begs the question whether this is really an SSV bug. I believe it is, because:
host
mode, just more rarelyhost
mode with Chainlink nodes without any issuesRationale / why is
host
mode desirable:host
mode speeds up connections, as it's direct to host instead of going through LB. It's a networking optimization. As well, failures like this should not happen, regardless of latency of networking - and right now they do happen in both modes, just rarely withouthost
.