Our deployment ansibles get run asynchronously via cron. We are currently not monitoring them in any way. If one is failing, it usually doesn't get noticed until one of us bothers to look. We should output something from ansible-runner tasks that indicate failure, and pick up on that via a datadog monitor. This can be something as simple as dropping a file for the failing environment somewhere on the filesystem, and a datadog monitor that triggers when said files exist?
Our deployment ansibles get run asynchronously via cron. We are currently not monitoring them in any way. If one is failing, it usually doesn't get noticed until one of us bothers to look. We should output something from ansible-runner tasks that indicate failure, and pick up on that via a datadog monitor. This can be something as simple as dropping a file for the failing environment somewhere on the filesystem, and a datadog monitor that triggers when said files exist?