projectcalico / calico

Cloud native networking and network security
https://docs.tigera.io/calico/latest/about/
Apache License 2.0
6.04k stars 1.35k forks source link

Calico reporting errors during the deployment of the K8S binary cluster #9516

Open blakechang001 opened 3 days ago

blakechang001 commented 3 days ago

Cluster information: Kubernetes version:1.21.10 Installation method: Host OS: calico version:v3.28.2 centos7

process: 1.Install through tigera-operator.yaml and custom-resources.yaml files 2.installation successful pod is running

NAME READY STATUS RESTARTS AGE
calico-kube-controllers-6696b5fc97-hlb84 1/1 Running 0 2m33s
calico-node-28flc 1/1 Running 0 2m34s
calico-node-p9tcg 1/1 Running 0 2m34s
calico-typha-9f54f8447-sgpnl 1/1 Running 0 2m34s
csi-node-driver-67h28 2/2 Running 0 2m33s
csi-node-driver-wgwvs 2/2 Running 0 2m33s

3.but the calico pod calico-node-28flc and calico-node-p9tcg events has the same error:

Events:
Normal SandboxChanged 3m20s kubelet Pod sandbox changed, it will be killed and re-created.
Normal Pulled 3m19s kubelet Container image “[docker.io/calico/pod2daemon-flexvol:v3.17.6](http://docker.io/calico/pod2daemon-flexvol:v3.17.6)” already present on machine
Normal Created 3m19s kubelet Created container flexvol-driver
Normal Started 3m19s kubelet Started container flexvol-driver
Normal Pulled 3m17s kubelet Container image “[docker.io/calico/cni:v3.17.6](http://docker.io/calico/cni:v3.17.6)” already present on machine
Normal Created 3m17s kubelet Created container install-cni
Normal Started 3m17s kubelet Started container install-cni
Normal Pulled 3m14s kubelet Container image “[docker.io/calico/node:v3.17.6](http://docker.io/calico/node:v3.17.6)” already present on machine
Normal Created 3m14s kubelet Created container calico-node
Normal Started 3m14s kubelet Started container calico-node
Warning Unhealthy 3m13s kubelet Readiness probe failed: calico/node is not ready: BIRD is not ready: Error querying BIRD: unable to connect to BIRDv4 socket: dial unix /var/run/bird/bird.ctl: connect: no such file or directory
Warning Unhealthy 3m10s kubelet Readiness probe failed: calico/node is not ready: BIRD is not ready: Error querying BIRD: unable to connect to BIRDv4 socket: dial unix /var/run/calico/bird.ctl: connect: connection refused

4.I changed the file custom-resources.yaml and add this contents for specify network card,but it is not work.

nodeAddressAutodetectionV4:
interface: ens33

5.port 179 is open:

lsof -i:179
COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAME
bird 12607 root 7u IPv4 68970 0t0 TCP :bgp (LISTEN)
bird 12607 root 8u IPv4 76064 0t0 TCP dk8s-work1:bgp->dk8s-work2:55595 (ESTABLISHED)

6.I had closed the firewall of all nodes,and the network is correct 7.the log of pod has error content as follow:

2024-11-20 05:49:25.617 [INFO][62] felix/int_dataplane.go 1501: attempted to modprobe nf_conntrack_proto_sctp error=exit status 1 output=“”
2024-11-20 05:49:25.663 [ERROR][62] felix/daemon.go 491: Failed to connect to Typha. Retrying… error=dial tcp 192.168.100.185:5473: connect: connection refused
2024-11-20 05:49:30.278 [INFO][62] felix/route_table.go 1085: Failed to access interface because it doesn’t exist. error=Link not found ifaceName=“vxlan.calico” ifaceRegex=“^vxlan.calico$” ipVersion=0x4
2024-11-20 05:49:30.278 [INFO][62] felix/route_table.go 1153: Failed to get interface; it’s down/gone. error=Link not found ifaceName=“vxlan.calico” ifaceRegex=“^vxlan.calico$” ipVersion=0x4
2024-11-20 05:49:30.278 [ERROR][62] felix/route_table.go 920: Failed to get link attributes error=interface not present ifaceRegex=“^vxlan.calico$” ipVersion=0x4
2024-11-20 05:49:30.281 [INFO][62] felix/route_table.go 1085: Failed to access interface because it doesn’t exist. error=Link not found ifaceName=“cali24209b127c5” ifaceRegex="^cali." ipVersion=0x4
2024-11-20 05:49:30.281 [INFO][62] felix/route_table.go 1153: Failed to get interface; it’s down/gone. error=Link not found ifaceName=“cali24209b127c5” ifaceRegex=“^cali." ipVersion=0x4
2024-11-20 05:49:30.281 [INFO][62] felix/route_table.go 1085: Failed to access interface because it doesn’t exist. error=Link not found ifaceName=“cali3b132ef3344” ifaceRegex="^cali.” ipVersion=0x4
2024-11-20 05:49:30.282 [INFO][62] felix/route_table.go 1153: Failed to get interface; it’s down/gone. error=Link not found ifaceName=“cali3b132ef3344” ifaceRegex=“^cali." ipVersion=0x4
2024-11-20 05:49:30.282 [INFO][62] felix/route_table.go 1085: Failed to access interface because it doesn’t exist. error=Link not found ifaceName=“calid96ca193351” ifaceRegex="^cali.” ipVersion=0x4
2024-11-20 05:49:30.282 [INFO][62] felix/route_table.go 1153: Failed to get interface; it’s down/gone. error=Link not found ifaceName=“calid96ca193351” ifaceRegex=“^cali.*” ipVersion=0x4
2024-11-20 05:49:30.543 [INFO][62] felix/vxlan_mgr.go 425: Failed to get VXLAN tunnel device, assuming it isn’t present error=Link not found

How to solve this problem?