department-of-veterans-affairs / va.gov-cms

Editor-centered management for Veteran-centered content.
https://prod.cms.va.gov
GNU General Public License v2.0
97 stars 69 forks source link

Critical External System Monitoring #5605

Open mchelen-gov opened 3 years ago

mchelen-gov commented 3 years ago

Background

CMS integrates with multiple external systems. The operation and behavior of these systems can negatively affect CMS functionality.

User Story or Problem Statement

As a CMS engineer, I need to keep track of and monitor systems that affect CMS functionality so that I can effectively maintain operation of the CMS system.

Affected users and stakeholders

Hypothesis

We believe that documenting and monitoring external dependencies will allow CMS team to be more aware of and respond more effectively to issues that affect the functionality of the CMS . We'll know that to be true when more systems that affect CMS are monitored and the time to resolution is decreased.

Assumptions

(How will these assumptions be validated?)

Acceptance Criteria

Runbook

This is particularly relevant for feature launches, or for making changes to the content model that require front end changes as well.

  1. Step_to_get_to_production_X
  2. Step_to_get_to_production_Y
  3. Step_to_get_to_production_Z

Possible tickets to create for this epic

CMS Team

Please leave only the team that will do this work selected. If you're not sure, it's fine to leave both selected.

cmaeng commented 2 years ago

@mchelen-gov this feels like ongoing maintenance of existing setups. can we close this out in favor of more quarterly timeboxed efforts?

mchelen-gov commented 1 year ago

@cmaeng The scope here was specifically around monitoring of critical external systems, which did not have monitoring implemented. It looks like the remaining issues here are around documentation, so those should get completed.