Ensure that Vets API RDS alerts and monitors are fully operational and set to notify the appropriate teams in case of potential system capacity, latency, etc issues.
Goals
[ ] Verify existing RDS monitors and alerts for system storage capacity.
[ ] Set up notifications to alert relevant parties before storage thresholds are reached.
[ ] Review and update RDS monitoring thresholds and alerts if necessary.
Background
Given the high volume of submissions expected to interact with our database systems, it is critical to monitor our RDS instances to prevent system crashes due to storage capacity being reached. Proactive monitoring will help to mitigate risks of downtime or data loss.
Action Items
[ ] Conduct an audit of current RDS alerting mechanisms for system storage capacity.
[ ] Update alert thresholds to ensure early warning ahead of potential system capacity issues.
[ ] Ensure that alerts are configured to notify the appropriate teams via Slack and PagerDuty
[ ] Document the RDS monitoring process and alerting protocols.
Importance of RDS Monitoring
Effective RDS monitoring is vital to:
Prevent service interruptions due to storage capacity issues.
Maintain data integrity and availability.
Ensure that performance remains optimal even as system load increases.
Allow for timely and informed decision-making in the event of potential system constraints.
Proactive alerts provide us with the ability to address issues before they escalate, maintaining our commitment to service reliability and performance.
Expected Outcomes
A robust set of RDS alerts and monitors tailored to our system's needs.
Clear documentation and communication protocols for responding to RDS capacity alerts.
Enhanced system reliability and uptime, with safeguards against capacity-related incidents.
Summary
Ensure that Vets API RDS alerts and monitors are fully operational and set to notify the appropriate teams in case of potential system capacity, latency, etc issues.
Goals
Background
Given the high volume of submissions expected to interact with our database systems, it is critical to monitor our RDS instances to prevent system crashes due to storage capacity being reached. Proactive monitoring will help to mitigate risks of downtime or data loss.
Action Items
Importance of RDS Monitoring
Effective RDS monitoring is vital to:
Proactive alerts provide us with the ability to address issues before they escalate, maintaining our commitment to service reliability and performance.
Expected Outcomes