FRRouting / frr

The FRRouting Protocol Suite
https://frrouting.org/
Other
3.21k stars 1.24k forks source link

BGP ZEBRA routes not recovering after interface reappearing #6327

Open lucize opened 4 years ago

lucize commented 4 years ago

Describe the bug A clear and concise description of what the bug is.

restarting a wireguard interface will not recover the routes from bgp

(put "x" in "[ ]" if you already tried following) [x] Did you check if this is a duplicate issue? [x] Did you test it on the latest FRRouting/frr master branch?

To Reproduce Steps to reproduce the behavior:

  1. ip link set wg0 down && sleep 4 && ip link set wg0 up
  2. error
netifd: Network device 'wg0' link is down
netifd: Network device 'wg0' link is up

zebra[14667]: Extended Error: Nexthop has invalid gateway
zebra[14667]: [EC 4043309093] netlink-dp (NS 0) error: Network unreachable, type=RTM_NEWNEXTHOP(104), seq=1734, pid=3243977578
zebra[14667]: Extended Error: Invalid nexthop id
zebra[14667]: [EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWNEXTHOP(104), seq=1735, pid=3243977578
[...]
zebra[14667]: Extended Error: Nexthop id does not exist
zebra[14667]: [EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWROUTE(24), seq=1793, pid=3243977578
zebra[14667]: [EC 4043309074] Failed to install Nexthop ID (105) into the kernel
zebra[14667]: [EC 4043309074] Failed to install Nexthop ID (104) into the kernel
zebra[14667]: default(0):10.160.13.0/24: Route install failed
zebra[14667]: default(0):10.160.23.0/24: Route install failed
zebra[14667]: default(0):10.162.22.0/24: Route install failed
zebra[14667]: default(0):10.164.17.0/24: Route install failed
zebra[14667]: default(0):10.164.27.0/24: Route install failed
zebra[14667]: default(0):10.249.119.8/30: Route install failed
[...]
zebra[14667]: default(0):10.100.10.0/24: Route install failed

Expected behavior A clear and concise description of what you expected to happen.

routes reinstalled bgpd[14822]: bgp_update_receive: rcvd End-of-RIB for IPv4 Unicast from 172.16.223.5 in vrf default

Versions

Additional context

frr.conf

router bgp 666
 bgp router-id 10.160.20.1
 bgp log-neighbor-changes
 timers bgp 8 25
 neighbor 172.16.223.5 remote-as 666
 !
 address-family ipv4 unicast
  network 10.160.20.0/24
  network 172.16.223.4/30
 exit-address-family
!

router-id 10.160.20.1 is the ip of the eth0 ip link set wg0 down && ip link set wg0 up will not show the error but will not work also

lucize commented 4 years ago

having the interface down for enough seconds seems to start as it should

DmitryShchegolev commented 1 year ago

Ospf could not install routes some times after link troubles: Mar 8 06:54:51 r1-msk-skl zebra[829]: [HSYZM-HV7HF] Extended Error: Nexthop id does not exist Mar 8 06:54:51 r1-msk-skl zebra[829]: [WVJCK-PPMGD][EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWROUTE(24), seq=2501934 , pid=4281162991 Mar 8 06:54:51 r1-msk-skl zebra[829]: [HSYZM-HV7HF] Extended Error: Nexthop id does not exist Mar 8 06:54:51 r1-msk-skl zebra[829]: [WVJCK-PPMGD][EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWROUTE(24), seq=2501935 , pid=4281162991 Mar 8 06:54:51 r1-msk-skl zebra[829]: [HSYZM-HV7HF] Extended Error: Nexthop id does not exist Mar 8 06:54:51 r1-msk-skl zebra[829]: [WVJCK-PPMGD][EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWROUTE(24), seq=2501936 , pid=4281162991 Mar 8 06:54:51 r1-msk-skl zebra[829]: [HSYZM-HV7HF] Extended Error: Nexthop id does not exist Mar 8 06:54:51 r1-msk-skl zebra[829]: [WVJCK-PPMGD][EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWROUTE(24), seq=2501937 , pid=4281162991 Mar 8 06:54:51 r1-msk-skl zebra[829]: [HSYZM-HV7HF] Extended Error: Nexthop id does not exist Mar 8 06:54:51 r1-msk-skl zebra[829]: [WVJCK-PPMGD][EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWROUTE(24), seq=2501938 , pid=4281162991 Mar 8 06:54:51 r1-msk-skl zebra[829]: [HSYZM-HV7HF] Extended Error: Nexthop id does not exist Mar 8 06:54:51 r1-msk-skl zebra[829]: [WVJCK-PPMGD][EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWROUTE(24), seq=2501939, pid=4281162991 Mar 8 06:54:51 r1-msk-skl zebra[829]: [HSYZM-HV7HF] Extended Error: Nexthop id does not exist Mar 8 06:54:51 r1-msk-skl zebra[829]: [WVJCK-PPMGD][EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWROUTE(24), seq=2501940, pid=4281162991 Mar 8 06:54:51 r1-msk-skl zebra[829]: [HSYZM-HV7HF] Extended Error: Nexthop id does not exist Mar 8 06:54:51 r1-msk-skl zebra[829]: [WVJCK-PPMGD][EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWROUTE(24), seq=2501941, pid=4281162991 Mar 8 06:54:51 r1-msk-skl zebra[829]: [HSYZM-HV7HF] Extended Error: Nexthop id does not exist Mar 8 06:54:51 r1-msk-skl zebra[829]: [WVJCK-PPMGD][EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWROUTE(24), seq=2501942, pid=4281162991 Mar 8 06:54:51 r1-msk-skl zebra[829]: [TX7F6-F3TSG] default(0:254):192.168.174.0/24: Route install failed Mar 8 06:54:51 r1-msk-skl zebra[829]: [TX7F6-F3TSG] default(0:254):192.168.123.0/24: Route install failed Mar 8 06:54:51 r1-msk-skl zebra[829]: [TX7F6-F3TSG] default(0:254):192.168.147.0/24: Route install failed Mar 8 06:54:51 r1-msk-skl zebra[829]: [TX7F6-F3TSG] default(0:254):192.168.0.0/23: Route install failed Mar 8 06:54:51 r1-msk-skl zebra[829]: [TX7F6-F3TSG] default(0:254):192.168.168.0/24: Route install failed Mar 8 06:54:51 r1-msk-skl zebra[829]: [TX7F6-F3TSG] default(0:254):192.168.144.0/24: Route install failed Mar 8 06:54:51 r1-msk-skl zebra[829]: [TX7F6-F3TSG] default(0:254):192.168.176.0/24: Route install failed Mar 8 06:54:51 r1-msk-skl zebra[829]: [TX7F6-F3TSG] default(0:254):192.168.141.0/24: Route install failed

So routes in vtysh "show ip ospf route" are different from routes in system "ip route". "show ip ospf route" are full routes, and "ip route" are partial. How can I fix it or test to gather any additional info?

Darwin4053 commented 3 weeks ago

Is this issue still open or closed? can you please provide configuration.

sworleys commented 3 weeks ago

@Darwin4053 nobody is actively working on this issue as far as I know