AI-multimodal / aimmdb

BSD 3-Clause "New" or "Revised" License
0 stars 10 forks source link

Deploy Prometheus and Grafana containers #39

Open danielballan opened 2 years ago

danielballan commented 2 years ago

Today the AIMM server became unusable because of a transient database problem on Spin. (Or so it seems: we do not have thorough evidence for this.)

At NSLS2 we deploy Prometheus and Grafana to monitor availability and performance. We receive notifications (alarms) when there are outages. It would be useful to grow a historical record of aimmdb uptime and to know proactively when it is down.