freifunkh / ansible

Here we store all Ansible roles and configs used for Freifunk Hannover.
MIT License
7 stars 3 forks source link

bird: no route to host, hold-timer expires #227

Closed 1977er closed 2 years ago

1977er commented 2 years ago

Since (around) 01/2022 bird is flapping ipv6 bgp sessions (both cix1 and cix2).

Logs show alot of clevertor bird[750]: Kernel dropped some netlink messages, will resync on next scan. and clevertor bird[750]: Netlink: No route to host before finally clevertor bird[750]: cix1_ip6: Error: Hold timer expired

In the aftermath routes from the ipv6 table get deleted (assumption) and ipv6 connectivity on some host (sn02,...) go south (fact).

1977er commented 2 years ago

Trying to increase net.ipv6.route.max_size according to https://www.mail-archive.com/bird-users@network.cz/msg05945.html.

Setting /etc/sysctl.d/11-ipv6-routing.conf manually.

1977er commented 2 years ago

Doesn't help against Kernel dropped some netlink messages, will resync on next scan messages.

1977er commented 2 years ago

Monitoring route to bgp endpoints via Zabbix. grafik

1977er commented 2 years ago

Adjusting net.ipv6.route.max_size might have helped against No route to host. No hold-timer issues since then. Will keep an eye on it anyway.

1977er commented 2 years ago

I guess the sysctl setting did the trick. No more No route to host for three days now. I declare the patient cured.

Moving the zabbix debug items into the template, but disabling them.