CDCgov / trusted-intermediary

Bringing together healthcare providers by reducing the connection burden.
Apache License 2.0
11 stars 5 forks source link

Azure Alerts for Errors - CDC TI services are down/offline #1398

Open scleary1cs opened 2 weeks ago

scleary1cs commented 2 weeks ago

Story

As a developer, I need to know if our services are down/offline, so that we can begin an incident.

Pre-conditions

Acceptance Criteria

Tasks

Engineering

Definition of Done

Research Questions

Decisions

Notes

pluckyswan commented 1 day ago

I believe this is covered by #1397

JohnNKing commented 15 hours ago

Already complete?

somesylvie commented 15 hours ago

1397 is about whether Azure services are down (like if e.g. all web applications are down or all of blob storage is unavailable). This card is about whether our team's services (TI and SFTP Ingestion Service) are down, which can happen even if Azure is fine. This card is about resource health in Azure parlance, while 1397 is about service health and Azure status