As a developer/operator of GC Notify I only want to be alerted if an actual problem exists so that I don't get alert fatigue in our staging and production environments.
WHY are we building?
Currently we are getting several "false alarm" alerts in our staging and production slack channels. In addition to being annoying, they have actually contributed towards releasing bad code since we were unable to differentiate between alerts that were false alarms and alerts that weren't.
WHAT are we building?
We have created a list of alarms that have been triggered in the past 14 days. We will create a task for each alarm (or group of related alarms) to be investigated by GC Notify dev and core.
VALUE created by our solution
We will increase the stability of Notify, release better code, and allow developers to concentrate on their tasks due to less noise in the operations channels.
Acceptance Criteria
[ ] All alarms that have been triggered in the past 14 days have been investigated and either adjusted/fixed or documented as to why we can't reduce noise
Description
As a developer/operator of GC Notify I only want to be alerted if an actual problem exists so that I don't get alert fatigue in our staging and production environments.
WHY are we building?
Currently we are getting several "false alarm" alerts in our staging and production slack channels. In addition to being annoying, they have actually contributed towards releasing bad code since we were unable to differentiate between alerts that were false alarms and alerts that weren't.
WHAT are we building?
We have created a list of alarms that have been triggered in the past 14 days. We will create a task for each alarm (or group of related alarms) to be investigated by GC Notify dev and core.
VALUE created by our solution
We will increase the stability of Notify, release better code, and allow developers to concentrate on their tasks due to less noise in the operations channels.
Acceptance Criteria