canonical / charm-openstack-service-checks

Collection of Nagios checks and other utilities that can be used to verify the operation of an OpenStack cluster
0 stars 2 forks source link

separate warning and crit errors of agents that are disabled|down #104

Open sudeephb opened 7 months ago

sudeephb commented 7 months ago

It's very hard to see which hosts are the troubled hosts when seeing the disabled merged with the down hosts

Sample:

CRITICAL: nova-compute, Host AZ1-Compute-1 disabled, Host AZ1-Compute-2 disabled, Host AZ2-Compute-1 disabled, Host AZ2-Compute-5 disabled, AZ1-Compute-14 down, Host AZ2-Compute-2 disabled, Host AZ1-Compute-6 disabled, Host AZ1-Compute-11 disabled, AZ1-Compute-13 down, Host AZ1-Compute-5 disabled, AZ2-Compute-8 down, Host AZ2-Compute-5 disabled, Host AZ1-Compute-2 disabled, Host AZ2-Compute-2 disabled, Host AZ1-Compute-1 disabled, Host AZ1-Compute-6 disabled, Host AZ1-Compute-11 disabled, Host AZ1-Compute-5 disabled, AZ1-Compute-14 down, AZ1-Compute-13 down

Improvement suggested: CRITICAL: nova-compute Hosts disabled: AZ1-Compute-1, AZ1-Compute-2, AZ2-Compute-1, AZ2-Compute-5, AZ2-Compute-2, AZ1-Compute-6, AZ1-Compute-11, AZ1-Compute-5, AZ2-Compute-5, AZ1-Compute-2, AZ2-Compute-2, AZ1-Compute-1, AZ1-Compute-6, AZ1-Compute-11, AZ1-Compute-5 Hosts down : AZ1-Compute-14, AZ1-Compute-13, AZ2-Compute-8, AZ1-Compute-14, AZ1-Compute-13


Imported from Launchpad using lp2gh.

sudeephb commented 7 months ago

(by guoqiao) Maybe we can move even further, to show them in a table with nagios multiple line msg.