Open drewpost opened 1 year ago
Pinging @elastic/uptime (Team:uptime)
After going through the options here with @kobelb we have two options:
Practically speaking option 1 is far more feasible in less time. Option 2 would require rethinking good chunks of our fleet integration, the synthetics service and kibana alerting.
One barrier to 1. is that currently serverless only supports a 60s minimum schedule, but they could make an exception for this alert.
Pinging @elastic/obs-ux-management-team (Team:obs-ux-management)
For the synthetics use case, customers require notification of issues with their monitors as close to instantaneously as possible. The current alerting framework available to the app only allows alerting via a look-back window. This delays sending an alert and impacts our end users' experience.
We want to be able to trigger an alert being sent with an event vs a time-based look back window. This would enable the following user flow: Synthetic Monitor executes a schedule or ad-hoc test run -> the test run fails -> the monitor moves into an error state and a new error event is opened -> an alert is triggered. There should be the absolute minimum amount of time required between each step of that flow.