Open fgalan opened 2 years ago
Hi @fgalan sir,
I would like to work on this issue, As per my understanding we need to add an alarm "NotificaitonQueue"
when it overpassed the threshold.
How many thresholds? For instance >80% is critical, 50-80% is moderate. However, the current raise-release mechanism gets >complicated if more than one level is defined, so probably the simpler approach is just one threshold.
We need to specify only one threshold value for that we can hardwired the threshold value in Orion code or we can add CLI for that.
Please confirm my understanding.
I think your understanding is correct. Thanks!
Is your feature request related to a problem / use case? Please describe.
Orion is able to log in the case of notification queue is full (in threadpool notification model), either for the general queue:
or for per-service queues, if that functionality is in use:
Thus, operation teams know about the queue is getting saturated when is already to late and notifications are being dropped.
Describe the solution you'd like
Implement a new alarm, this way:
<service>
":<detail>
".<service>
", where<service>
is the same one that triggered the alarm. Orion prints this trace when notification queue goes back below the threshold.<service>
"default" for default queue) has overpassed the alarm threshold. The<detail>
text described the particular threshold.Things to decide:
Describe alternatives you've considered
None so far
Describe why you need this feature
It would be useful for the operation teams using Orion, so they can define alarms based in Orion logs.