zc / cimaa

Coordinated Independent Monitor Agent Architecture
0 stars 1 forks source link

Coordinating Independent Monitoring Agent Architecture (CIMAA)


.. contents::

We were looking for a monitoring solution that could replace our previous in-house system, which replaces Nagios. Certain characteristics were very important to us:

We shopped and failed to find an existing system that addressed our needs. Maybe we would have found something eventually, but we realized that will less effort than it would take to find and integrate what we needed, we could build something very simple.

We'd built an in-house system before, which while satisfying some of the requirements above, still fell short and was more complicated than what we were comfortable maintaining over the long run. Experience with this system and with Nagios earlier informed out requirements and our approach.

Architectural overview


A CIMAA system consists of one or more agents spread over each machine we control. Generally, each agent is only responsible for checking the machine it runs on.

Agents

Checks

Meta checks

We'll need to put some thought into strategies and support for avoiding thundering herds.

Squelches

Status


We're still building.

Done

To do

Changes


0.6.0 (2015-05-29)

0.5.3 (2015-03-10)

0.5.2 (2015-02-24)

0.5.1 (2015-02-11)

0.5.0 (2015-02-10)

0.4.1 (2015-02-09)

0.4.0 (2015-02-09)

0.3.1 (2015-02-07)

0.3.0 (2015-01-21)

0.2.3 (2015-01-16)

0.2.2 (2015-01-16)

0.2.1 (2015-01-16)

0.2.0 (2015-01-14)

0.1.3 (2014-12-22)

Fix local variable reference in DynamoDB implementation.

0.1.2 (2014-12-18)

Restore message field on fault records returned by DynamoDB, if omitted because of empty string value.

0.1.1 (2014-12-17)

Fixed log level configuration for Sentry.

0.1.0 (2014-12-15)

Initial release.

.. _Slack: https://slack.com/