Closed jgreben closed 4 years ago
RDS (sinopia-production-rds) is being backed up. Daily snapshots done at 3AM and is kept for up to five days.
trellis EFS is being backed up nightly at 10PM. They are kept within the AWS Backup (trellis-production-efs) resource for up to 97 days before expiring.
Cognito users are also backed up to an S3 bucket (sinopia-cognito-production) and backed up every 8 hours. Manually looking at the older S3 bucket backups shows users information etc.
elasticsearch backups are automatically done hourly for version 5.3 or above. We are on version 6.5.
The following are cloudwatch monitoring that are configured to send alerts to both sinopia-dev@lists.stanford.edu and dlss-dev-alerts@lists.stanford.edu
sinopia-homepage-production-elb-response-slow sinopia-homepage-production-cpu-high-alert sinopia-homepage-production-cpu-high sinopia-homepage-production-elb-health-hosts sinopia-pe-production-elf-health-hosts sinopia-pe-production-cpu-high sinopia-pe-production-elb-response-slow sinopia-pe-production-cpu-high-alert trellis-production-elb-health-hosts trellis-production-mem-high-alert trellis-production-elb-response-slow
@kamchan Do we we monitor that the ElasticSearch cluster is green?
We should add Honeybadger.
@justinlittman The elasticsearch cluster being yellow is because there is a single node in that cluster. There is a list of monitoring (ie cpu, java mem usage, kibana search checks) that will allow us to monitor that service.
- ensure whole team is capable of doing "the stuff" for sinopia (deployments, setting up local laptops to do what is nec)
- hints on debugging problems across diff AWS containers
@ndushay I added some of your points to be included in the checklist at the top.
Following elasticsearch monitoring checks have been added to prod/stag/dev.
sinopia-es-production-healthy-nodes-alert sinopia-es-production-jvm-mem-alert sinopia-es-production-cpu-high-alert
@jgreben is this epic complete?
@jermnelson can we close this?
@michelleif - closing this issue.
What do we need to do to make sure Sinopia keeps running after the feature development cycles come to an end?