banzaicloud / spot-termination-exporter

Prometheus spot instance exporter to monitor AWS instance termination with Hollowtrees
Apache License 2.0
36 stars 16 forks source link

Instance scheduled maintenance #14

Open danielmotaleite opened 3 years ago

danielmotaleite commented 3 years ago

Is your feature request related to a problem? Please describe. I'm looking for a metric that reports ec2 instance scheduled maintenance on ondemand instances. Tried this one but only works now for stop instances. Adding support for ondemand should parse the http://169.254.169.254/latest/meta-data/events/maintenance/scheduled entry. I do not know if spot instances also may use this entry for underlying host problems

Describe the solution you'd like to see A new metric for aws_scheduled_maintenance_seconds

$ curl http://169.254.169.254/latest/meta-data/events/maintenance/scheduled [ { "NotBefore" : "1 Jan 2021 02:00:00 GMT", "Code" : "instance-stop", "Description" : "The instance is running on degraded hardware", "EventId" : "instance-event-07a1c1c08b77d4bf1", "State" : "active" } ]

Describe alternatives you've considered Right now we manage this via the email notifications that is far from ideal way to find and control what instances may have problems, the deadline for releasing the host and if it was already solved.

Additional context https://docs.aws.amazon.com/AWSEC2/latest/WindowsGuide/monitoring-instances-status-check_sched.html#viewing_scheduled_events