kiwix / operations

Kiwix Kubernetes Cluster
http://charts.k8s.kiwix.org/
5 stars 0 forks source link

Better grafana to Slack alerts #153

Open rgaudin opened 6 months ago

rgaudin commented 6 months ago

With grafana, we now have access to alerts on HTTP 50x errors (on for all 50x) and one for 500 on all our containers. It's a good thing but the way we get those alerts is not ideal:

The most annoying to me is the resolve message that creates noise. The fact that I have to switch context to go to slack, select channel and click on the message to find out about the namespace (and infer the culprit container) may look like nothing to care about but it's actually a huge blocker to react on issues.

Uptime robot alerts very clear and useful in comparison. From the notification preview I already know what's going on an can decide whether it requires immediate action or not.

With grafana, I can't so I mostly decide that I'll look at it when I get a chance…

Screenshot 2023-12-16 at 14 30 24
Screenshot 2023-12-16 at 14 30 41
Screenshot 2023-12-16 at 14 30 51
benoit74 commented 6 months ago

I can only agree 👍