Closed dianakhuang closed 5 days ago
DD missing code owner dashboard: https://app.datadoghq.com/dashboard/c9q-24y-fbe/lms---missing-codeowner
Safe Sessions User Mismatches: https://app.datadoghq.com/dashboard/drj-zgi-yrg/safe-session-user-mismatches
Codejail error breakdown: https://app.datadoghq.com/dashboard/qye-jtx-kjr/codejail-error-breakdown-edx
All arch-bom dashboards: https://app.datadoghq.com/dashboard/lists?q=team%3Aarch-bom
@dianakhuang I reviewed these, and I think https://app.datadoghq.com/dashboard/m3b-uvv-9w4 is the only one remaining for review (LMS alert diagnosis dashboard).
@timmc-edx dashboard looks good to me. I think the documentation/runbook needs to be updated to help people diagnose errors, but I think that's partially on me as well.
I converted the last of the New Relic links in https://2u-internal.atlassian.net/wiki/spaces/ENG/pages/1083605119/Squad-owned+LMS+Datadog+alert+runbook although I think the on-dashboard docs are going to be the most helpful.
I was thinking of the docs that say "See further directions in alert runbook on how to get further information about the errors." and making sure we have docs around how to get more info on the runbook page.
Acceptance Criteria
Migrate the following dashboards:
Event Bus Kafka overview(Moved to Kafka ticket.)