Closed brandond closed 2 months ago
sorry posted k3s results here and closed by mistake. hence re-opening (deleted the k3s results). will update with rke2 results and close next week.
Infrastructure
Node(s) CPU architecture, OS, and Version:
$ cat /etc/os-release
PRETTY_NAME="Ubuntu 22.04.2 LTS"
$ uname -m
x86_64
Cluster Configuration:
HA : 3 server / 1 agent
Config.yaml:
token: xxxx
write-kubeconfig-mode: "0644"
node-external-ip: 1.1.1.1
$ sudo mkdir -p /etc/rancher/rke2 && sudo cp config.yaml /etc/rancher/rke2
curl -sfL https://get.rke2.io | sudo INSTALL_RKE2_VERSION='v1.27.16-rc4+rke2r1' INSTALL_RKE2_TYPE='server' INSTALL_RKE2_METHOD=tar sh -
$ sudo systemctl enable --now rke2-server
or
$ sudo systemctl enable --now rke2-agent
kubectl get nodes -o wide
kubectl get pods -A
netstat -na | grep 6443
Disconnect the network on that server: ip link set dev eth0 down
(or whatever interface that node is using)
The failed server should get removed from the server listReplication Results:
$ rke2 -v
rke2 version v1.27.15+rke2r1 (4014ef7e8fbfb47e6a1e2261a73ca50aef8ba7fb)
go version go1.21.11 X:boringcrypto
level=error msg="Remotedialer proxy error; reconnecting..." error="websocket: close 1006 (abnormal closure): unexpected EOF" url="wss://<ip1>:9345/v1-rke2/connect"
level=info msg="Closing 1 connections to load balancer server <ip1>:6443"
level=info msg="Connecting to proxy" url="wss://<ip1>:9345/v1-rke2/connect"
level=error msg="Failed to connect to proxy. Empty dialer response" error="dial tcp <ip1>:9345: connect: connection refused"
.
.
level=info msg="Removing server from load balancer rke2-api-server-agent-load-balancer: <ip1>:6443"
level=info msg="Updated load balancer rke2-api-server-agent-load-balancer server addresses -> [<ip2>:6443 <ip3>:6443] [default: <ip1>:6443]"
level=info msg="Removing server from load balancer rke2-agent-load-balancer: <ip1>:9345"
level=info msg="Updated load balancer rke2-agent-load-balancer server addresses -> [<ip2>:9345 <ip3>:9345] [default: <ip1>:9345]"
Validation Results:
$ rke2 -v
rke2 version v1.27.16-rc4+rke2r1 (4f40ff06079f648f3194b344c4199997a970614d)
go version go1.22.5 X:boringcrypto
level=error msg="Remotedialer proxy error; reconnecting..." error="websocket: close 1006 (abnormal closure): unexpected EOF" url="wss://<ip1>:9345/v1-rke2/connect"
level=info msg="Closing 3 connections to load balancer server <ip1>:6443"
level=debug msg="Failed over to new server for load balancer rke2-api-server-agent-load-balancer: <ip1>:6443 -> <ip3>:6443"
level=info msg="Connecting to proxy" url="wss://<ip1>:9345/v1-rke2/connect"
level=error msg="Failed to connect to proxy. Empty dialer response" error="dial tcp <ip1>:9345: connect: connection refused"
level=error msg="Remotedialer proxy error; reconnecting..." error="dial tcp <ip1>:9345: connect: connection refused" url="wss://<ip1>:9345/v1-rke2/connect"
.
.
level=info msg="Removing server from load balancer rke2-api-server-agent-load-balancer: <ip1>:6443"
level=info msg="Updated load balancer rke2-api-server-agent-load-balancer server addresses -> [<ip2>:6443 <ip3>:6443] [default: <ip1>:6443]"
level=info msg="Removing server from load balancer rke2-agent-load-balancer: <ip1>:9345"
level=info msg="Updated load balancer rke2-agent-load-balancer server addresses -> [<ip2>:9345 <ip3>:9345] [default: <ip1>:9345]"
level=info msg="Stopped tunnel to <ip1>:9345"
Backport fix for Agent loadbalancer may deadlock when servers are removed
6208