ministryofjustice / operations-engineering

This repository is home to the Operations Engineering's tools and utilities for managing, monitoring, and optimising software development processes at the Ministry of Justice. • This repository is defined and managed in Terraform
https://user-guide.operations-engineering.service.justice.gov.uk/
MIT License
14 stars 5 forks source link

Status Pages as a Service #186

Closed AntonyBishop closed 2 years ago

AntonyBishop commented 3 years ago

Placeholder to investigate value of offering a Status Page service.

https://mojdt.slack.com/archives/C01BUKJSZD4/p1631109709052700

User Stories - Assumed, to be tested

As a HMPPS Probation user I need to know the status of the services we use So that I can change usage if needed

As a HMPPS Probation user I need to know the status of the services we use So that I can inform my team of changes to our expected workflow

As a HMPPS Probation user I need to know the status of the services we use So that I can inform our users of possible interruptions to service levels

As an external provider I need to know the status of the services we interact with So that I can change usage if needed

As an external provider I need to know the status of the services we interact with So that I can monitor any required changes to our provision

Definition of Done

AntonyBishop commented 3 years ago

Requires more user input

nimphal commented 3 years ago

After having a chat with the original requester, there isn't anything for us to do with this right now, so closing.

sldblog commented 2 years ago

👋 Hi @AntonyBishop and @Nimphal! We would like to revisit this, if possible, please.

Context: we provide a service that depends on a few shared services and being used by

Additionally, the above users will contact 1st and 2nd line support when the service is unavailable, so they would need to be able to see if there is or was an incident

These users have no visibility into what's happening in Slack.

So I think we have a strong case to build something public that acts similar to other public status pages:


As a wish, it would be great if a dependent service is down, we could link to their incident as a way for our users to track progress. But we can probably do that manually in the beginning.

AntonyBishop commented 2 years ago

Return for triage

AntonyBishop commented 2 years ago

Need to have a conversation with @ScottSeaward re research potential.

AntonyBishop commented 2 years ago

Awaiting more feedback via upcoming user survey.

AntonyBishop commented 2 years ago

Relevant discussion related to Cloud Platform - https://github.com/ministryofjustice/cloud-platform/issues/3337

AntonyBishop commented 2 years ago

Limited value. Not going to pursue. Ops Eng not best placed to solve this problem.