Right now if we API will return -1 for a bunch of metrics when errors are thrown by the CouchDb queries to retrieve required values. However, when this happened for a production instance, date.uptime and date.current were correctly incrementing, so Watchdog didn't think anything was wrong.
Instead, we should fire an alert to indicate that something is wrong.
In the outage itself, the monitoring API returned this:
Right now if we API will return -1 for a bunch of metrics when errors are thrown by the CouchDb queries to retrieve required values. However, when this happened for a production instance,
date.uptime
anddate.current
were correctly incrementing, so Watchdog didn't think anything was wrong.Instead, we should fire an alert to indicate that something is wrong.
In the outage itself, the monitoring API returned this:
which in turn looked like
0
value for everything instead of-1