cfl0ws / cosmos-validator-mission-control

15 stars 5 forks source link

Identify tool combinations to implement the spec #7

Open cfl0ws opened 4 years ago

cfl0ws commented 4 years ago

I'm currently leaning toward a combination of Prometheus and Icinga to implement the spec. Prometheus is well-proven and familiar to the Cosmos community.

I've been looking at and experimenting with Icinga for a while. It seems to be an effective open-source monitoring and alerting tool.

Other tools mentioned in the past by community members include -

Graylog - Log monitoring

Zabbix - Network, server, cloud and application monitoring

ELK - Network operations monitoring

Next steps -

cfl0ws commented 4 years ago

I've shortlisted the tools to -

Next step will be to map each to the requirements docs to identify the best fit or fits, if a combination of tools may be required.

cfl0ws commented 4 years ago

After doing some additional research, it seems like a combination of Prometheus and Icinga is the way to go. If this combination falls short, I'll look at Zabbix again.

cfl0ws commented 4 years ago

It turns out that Icinga was unnecessary, as we were able to customize Grafana/Prometheus to send the alerts we needed to send, using a custom-built alerting module, which will be open-sourced with the rest of the tool.