NagiosEnterprises / nagioscore

Nagios Core
GNU General Public License v2.0
1.56k stars 449 forks source link

Notification interval back-off #616

Open X01XX opened 5 years ago

X01XX commented 5 years ago

Hello, Nagios XI support recommended I open this.

To avoid having a large number of alerts during the weekend or other time off, it has been suggested at my site to logarithmically increase the notification interval.

If the second alert is after 30 minutes, the third is 60 minutes, then 120 minutes.

ericloyd commented 5 years ago

While you can accomplish this today with escalations, I get what you're saying (though it's a geometric sequence, not a logarithmic one, but that's splitting hairs).

I think this could easily be implemented with some sort of "notification_interval_multiplier" parameter (defaults to "1") that indicates how many notification_intervals should be skipped before notifying again, keeping in mind which notification number this is.

The logic would be:

next_notification = notification_interval * notification_interval_multiplier ^ notification_number

X01XX commented 5 years ago

Hello Eric,

Any form of increase, logarithmic or geometric, will do.

I think what you suggest would work.

There is an OLD OLD event monitor called BigBrother, which someone renamed to WatchDog for our site, that did that for every alert, and people liked it.

Thanks for the quick response!

Earl

From: Eric Loyd [mailto:notifications@github.com] Sent: Tuesday, January 29, 2019 4:16 PM To: NagiosEnterprises/nagioscore nagioscore@noreply.github.com Cc: Earl Dukerschein earl.dukerschein@wisc.edu; Author author@noreply.github.com Subject: Re: [NagiosEnterprises/nagioscore] Notification interval back-off (#616)

While you can accomplish this today with escalations, I get what you're saying (though it's a geometric sequence, not a logarithmic one, but that's splitting hairs).

I think this could easily be implemented with some sort of "notification_interval_multiplier" parameter (defaults to "1") that indicates how many notification_intervals should be skipped before notifying again, keeping in mind which notification number this is.

The logic would be:

next_notification = notification_interval * notification_interval_multiplier ^ notification_number

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://github.com/NagiosEnterprises/nagioscore/issues/616#issuecomment-458729336, or mute the threadhttps://github.com/notifications/unsubscribe-auth/As-bk8c4KdH0VSjSA_h3ikdPHP6w8hKIks5vIMgpgaJpZM4aY95a.

ericloyd commented 5 years ago

We implemented BigBrother for the team I managed a long time ago at an ISP, back when 56K modems were all the rage. It was really cool seeing all the little red/green lights for a few thousand modem lines as they were being used. :-)

I still think the idea is a good one, but any time adding/changing Nagios configuration parameters comes into play, it's a "Big Deal." I'm sure Jake will take care of it, though. :-)

jomann09 commented 5 years ago

This is a good idea, I will have to take a look into it to see how feasible it actually is to do, but I don't think it will actually be too bad.

vladki77 commented 1 week ago

Yay, that would be nice feature.