Open jimleroyer opened 11 months ago
Waiting on team lead to organize polling and maybe another meeting with the SRE/Core team to have their takes.
We had a meeting today and following conclusions came, along with a few action items:
Next action is waiting on team leader for chiming in with Platform Core team, and director to speak with management on compensation.
Jimmy met with Ioana, we might be blocked on this due to lack of funding approval...
Jimmy to ping Mohamed and Ioana about this. Must likely this specific card will get rescoped to a mechanism to reach out to others.
Jimmy asking today!
Jimmy will document process.
I didn't work on this today but I ask our favorite AI to generate documentation on this. I will review tomorrow, correct if necessary and put this in a Google document.
Monitor Alert Status:
Handle Acknowledgments and Resolutions:
Well the AI description needed some tweaking but it wasn't far off surprisingly.
The new documentation ready to be reviewed: https://docs.google.com/document/d/1FsGzDwWdZ51B6AHdCK2W4NA-vHAGmi6f7xmv5I2f0Lc/
Ben, Yael and Jumana gave their approval on the document after review. Closing the ticket as it has been through review.
Description
As an incident command or ops lead, I want a second line of defense on support, So that I can ping extra help if I become unavailable or needs an extra help if support 🔥 is ravaging.
WHY are we building?
More reliability on support. We are running a 24/7 service and needs to ensure we are always able to have our IC and OL roles fulfilled.
WHAT are we building?
Potentially a second team in opsgenie that would be inserted into the escalation policies, notified after the main support team.
VALUE created by our solution
Reliability and resilience in our support.
Acceptance Criteria
QA Steps