Closed nateborr closed 1 month ago
All modified and coverable lines are covered by tests :white_check_mark:
Project coverage is 94.18%. Comparing base (
b7ed7c5
) to head (2e2be1e
).
:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.
Description
Update the developer documentation for egress proxy setup and Tock deployment. This documents the egress proxy updates we've just applied in staging to restore New Relic traffic.
Also reduce the New Relic agent log level back to
info
.Addresses #1792 .
Testing
Once deployed, SSH into the tock-staging container and confirm that the New Relic agent is only logging at the info level and above:
Production deployment
With this update, we are clear to deploy the changes that have accumulated in staging over several weeks, including these latest changes for New Relic data.
That will consist of three steps:
gov-collector.newrelic.com
domain the the egress proxy's allow list. This reflects the egress config that I've validated in staging.production-egress
on port 8080.Step 3 is a clean-up step that I've applied in staging; it closes an unused path for external network traffic and simplifies the configuration for future maintenance.
Avoiding downtime
Our unexpected downtime in staging when we applied the New Relic egress related changes was due to a few factors:
run.sh
script that caused application start-up to fail if the New Relic admin tool failed to record the deploymentIssue 2 was resolved by #1807 (although that fix won't be in effect until the release is deployed) and the other two issues will be mitigated by following the deployment steps in the order above with a pair / co-pilot, and with manual smoke-testing of Tock between each step.