@edmund-dunn and I have discussed the random issues recently. It'd be great to expand the logging/metrics capture we do so we can correlate any weird issues we encounter with other issues in the broader system.
[ ] RDS - Slow Queries
[ ] Tugboat Disk Storage Space (Relates to #10987 and #13095)
Acceptance Criteria
[ ] We've identified a few ways in which our metrics/logging can be expanded for greater insight into the system.
[ ] We've added this to a dashboard or some other system (see #14152 ) for ease of access during crises.
Description
@edmund-dunn and I have discussed the random issues recently. It'd be great to expand the logging/metrics capture we do so we can correlate any weird issues we encounter with other issues in the broader system.
Acceptance Criteria