edx / edx-arch-experiments

A plugin to include applications under development by the architecture team at edx
GNU Affero General Public License v3.0
0 stars 3 forks source link

Migrate Arch-BOM dashboards #661

Closed dianakhuang closed 5 days ago

dianakhuang commented 1 month ago

Acceptance Criteria

Migrate the following dashboards:

dianakhuang commented 1 month ago

DD missing code owner dashboard: https://app.datadoghq.com/dashboard/c9q-24y-fbe/lms---missing-codeowner

dianakhuang commented 1 month ago

Safe Sessions User Mismatches: https://app.datadoghq.com/dashboard/drj-zgi-yrg/safe-session-user-mismatches

dianakhuang commented 1 month ago

Codejail error breakdown: https://app.datadoghq.com/dashboard/qye-jtx-kjr/codejail-error-breakdown-edx

dianakhuang commented 3 weeks ago

All arch-bom dashboards: https://app.datadoghq.com/dashboard/lists?q=team%3Aarch-bom

timmc-edx commented 2 weeks ago

@dianakhuang I reviewed these, and I think https://app.datadoghq.com/dashboard/m3b-uvv-9w4 is the only one remaining for review (LMS alert diagnosis dashboard).

dianakhuang commented 2 weeks ago

@timmc-edx dashboard looks good to me. I think the documentation/runbook needs to be updated to help people diagnose errors, but I think that's partially on me as well.

timmc-edx commented 2 weeks ago

I converted the last of the New Relic links in https://2u-internal.atlassian.net/wiki/spaces/ENG/pages/1083605119/Squad-owned+LMS+Datadog+alert+runbook although I think the on-dashboard docs are going to be the most helpful.

dianakhuang commented 2 weeks ago

I was thinking of the docs that say "See further directions in alert runbook on how to get further information about the errors." and making sure we have docs around how to get more info on the runbook page.