department-of-veterans-affairs / va.gov-cms

Editor-centered management for Veteran-centered content.
https://prod.cms.va.gov
GNU General Public License v2.0
79 stars 59 forks source link

3/10 Prod Deployment Failure #17508

Open gracekretschmer-metrostar opened 3 months ago

gracekretschmer-metrostar commented 3 months ago

User Story or Problem Statement

As a CMS tech lead, how may I most strategically prevent the prod deployment failure from happening in the future?

Identify how staging handles clearing drupal's cache The rollback procedure is a manual one, it would be nice to have something more automated

Description or Additional Context

On 3/10 the deployment to prod failed. After investigation, the CMS team discovered that a dependabot bump caused the prod deployment failure. After reversing the dependabot bump, the team was able to resolve the error and prod successfully deployed.

Relevant Links

Hypothesis

The team hypothesize two following solutions to prevent this situation in the future:

In addition, if we could broaden our testing then we could catch these issues earlier in the pipeline.

Acceptance Criteria

Team

Please check the team(s) that will do this work.

7hunderbird commented 2 months ago

@gracekretschmer-metrostar I'm just taking a fresh look at tickets that I'm assigned.

When I look at this ticket I can use the slack link to research what happened, but the experience of working on it is a bit stale because it's been a while.

I guess I wanted to get it back on your radar and perhaps work together on next steps.