filecoin-saturn / roadmap

0 stars 0 forks source link

Create a "runbook" for on call with lists of all metrics/alerts we are monitoring #22

Closed reidlw closed 11 months ago

reidlw commented 11 months ago

for lists of alerts/metrics, let's use the table Will put in the PRD: https://www.notion.so/pl-strflt/Saturn-Rhea-Observability-3ba02be6997b45d989758ca3de75ade4

probably best to convert this to Notion table and sync content across pages

let's maybe put thgis under on-call doc in notion

joaosa commented 11 months ago

This exists under here and on Will's doc. I'm thinking the actions column can be later expanded to link into specific runbooks.

Ideally, this should all be parsed from the alerts file. This way, we can even link the runbooks in the alert description.

joaosa commented 11 months ago

I'd say this is done. Here are the instructions on how to update the alerts table which is linked both here and here