canonical / charm-openstack-service-checks

Collection of Nagios checks and other utilities that can be used to verify the operation of an OpenStack cluster
0 stars 2 forks source link

Monitoring of DVR-related fip/qrouter namespace veth pairs #126

Open sudeephb opened 6 months ago

sudeephb commented 6 months ago

In response to a recent customer issue, it's been suggested that we add some sort or check which allows us to detect when DVR routing is broken. Specifically, this is in response to this bug: https://bugs.launchpad.net/neutron/+bug/1894843

When this bug occurs, the veth pairs between the fip and qrouter namespaces may get inadvertently removed, resulting in the fip namespace being unable to forward traffic to the qrouter namespace, thus effectively floating IPs break for instances on hosts where this occurs.

It's been suggested that we add some sort of alert so we can see if the veth links between the qrouter and fip namespaces are missing, so as to indicate a problem with DVR health.


Imported from Launchpad using lp2gh.

sudeephb commented 6 months ago

(by vultaire) Added the neutron-openvswitch charm. This type of a check would probably need to be checked directly on each node which could host such qrouter/fip namespaces, so it seems more appropriate.