telekom / das-schiff-network-operator

Configure netlink interfaces, simple eBPF filters and FRR using Kubernetes resources.
Apache License 2.0
28 stars 2 forks source link

[Feature] Recover from FRR reload failures #62

Closed MaxRink closed 11 months ago

MaxRink commented 1 year ago

If FRR dies during a reload network operator should detect this and then try to restart FRR. Currently NWOP does not recover frr if it fails and thus manual interaction

chdxD1 commented 11 months ago

If the reload systemd job has the status failed (we now wait for the response) a restart is tried. This might take some minutes until the job times out.