LD4P / sinopia

LD4P Sinopia Project repo to hold docs, general issues, schemas, and related spec docs.
https://ld4p.github.io/sinopia/
19 stars 3 forks source link

SC #11 Monitoring and Maintenance #269

Closed jgreben closed 4 years ago

jgreben commented 4 years ago

What do we need to do to make sure Sinopia keeps running after the feature development cycles come to an end?

kamchan commented 4 years ago

The following are cloudwatch monitoring that are configured to send alerts to both sinopia-dev@lists.stanford.edu and dlss-dev-alerts@lists.stanford.edu

sinopia-homepage-production-elb-response-slow sinopia-homepage-production-cpu-high-alert sinopia-homepage-production-cpu-high sinopia-homepage-production-elb-health-hosts sinopia-pe-production-elf-health-hosts sinopia-pe-production-cpu-high sinopia-pe-production-elb-response-slow sinopia-pe-production-cpu-high-alert trellis-production-elb-health-hosts trellis-production-mem-high-alert trellis-production-elb-response-slow

justinlittman commented 4 years ago

@kamchan Do we we monitor that the ElasticSearch cluster is green?

justinlittman commented 4 years ago

We should add Honeybadger.

ndushay commented 4 years ago
ndushay commented 4 years ago
kamchan commented 4 years ago

@justinlittman The elasticsearch cluster being yellow is because there is a single node in that cluster. There is a list of monitoring (ie cpu, java mem usage, kibana search checks) that will allow us to monitor that service.

jgreben commented 4 years ago
  • ensure whole team is capable of doing "the stuff" for sinopia (deployments, setting up local laptops to do what is nec)
  • hints on debugging problems across diff AWS containers

@ndushay I added some of your points to be included in the checklist at the top.

kamchan commented 4 years ago

Following elasticsearch monitoring checks have been added to prod/stag/dev.

sinopia-es-production-healthy-nodes-alert sinopia-es-production-jvm-mem-alert sinopia-es-production-cpu-high-alert

michelleif commented 4 years ago

@jgreben is this epic complete?

michelleif commented 4 years ago

@jermnelson can we close this?

jermnelson commented 4 years ago

@michelleif - closing this issue.