naemon / naemon-core

Networks, Applications and Event Monitor
http://www.naemon.io/
GNU General Public License v2.0
151 stars 63 forks source link

add new options to keep services running as long as they are ok #459

Closed sni closed 5 months ago

sni commented 5 months ago

The issue with the options:

is that reports break because hosts/services suddenly stop executing and keep their OK state. Which makes those options pretty unusable.

So in order to keep reporting correct, you need to keep services running, even if the host is down. With these new options, hosts/services keep on running as long as they are up. And as soon as the service is down, it stops running until the host comes back up. That way naemon has to do less checks, especially less checks which run into timeouts and such but reporting is still correct.

The option service_skip_check_dependency_status=-2 will also be used for service parents.

Adding a new option service_parents_disable_service_checks to prevent running service checks if service parents are down.

Recommended settings are:

host_down_disable_service_checks=1       ; disable service checks if host is down
service_parents_disable_service_checks=1 ; also disable service checks if parents are down
service_skip_check_host_down_status=-2   ; but keep running as long as they are ok
service_skip_check_dependency_status=-2  ; same, but for dependency checks.
host_skip_check_dependency_status=-2     ; and for host checks.
sni commented 5 months ago

PR is ready and runs here since 2 days without issues.