As a developer/operator of GC Notify, I would like to be able to see the performance metrics of Kubernetes pods/deployments within the Notify Kubernetes Cluster. So that we can better evaluate the load of GC Notify and support the scaling up/scaling out of the system.
WHY are we building?
When running performing tests, it is extremely useful to see how individual systems are behaving so we can better target where we need to focus our efforts on performance tuning. This is also useful for standard Notify support.
WHAT are we building?
The pod metrics are available in AWS Cloudwatch metrics right now, but there is no dashboard leveraging them. We need to create a dashboard that shows these pod metrics in a meaningful way.
VALUE created by our solution
We will be more aware of how Notify is behaving and have additional information at our disposal when troubleshooting issues.
Acceptance Criteria
[ ] Dashboard that shows the overall status of Notify in Kubernetes
[ ] Dashboard that shows performance metrics of Notify in Kubernetes
[ ] Dashboards are configured as code within notification-terraform repo
[ ] Demo the dashboards to notify team and get their feedback
QA Steps
[ ] Run a performance test against staging and validate that the metrics are behaving as expected
[ ] Check the dashboard, and confirm that the pods are reporting the correct pod metrics
Description
As a developer/operator of GC Notify, I would like to be able to see the performance metrics of Kubernetes pods/deployments within the Notify Kubernetes Cluster. So that we can better evaluate the load of GC Notify and support the scaling up/scaling out of the system.
WHY are we building?
When running performing tests, it is extremely useful to see how individual systems are behaving so we can better target where we need to focus our efforts on performance tuning. This is also useful for standard Notify support.
WHAT are we building?
The pod metrics are available in AWS Cloudwatch metrics right now, but there is no dashboard leveraging them. We need to create a dashboard that shows these pod metrics in a meaningful way.
VALUE created by our solution
We will be more aware of how Notify is behaving and have additional information at our disposal when troubleshooting issues.
Acceptance Criteria
QA Steps