Open johrstrom opened 1 year ago
Also be good to cross reference the XDMOD module to parse OnDemand usage logs: https://ondemand.xdmod.org/10.0/overview.html. There is some tooling at OSC to make that process easier, most of it is here: https://github.com/treydock/puppet-module-xdmod/tree/master/templates/ondemand
WRT Nagios, that's actually a lot harder to document and be useful than Prometheus as Prometheus it's pretty easy to share alerts and configs with other sites but Nagios is configured so many different ways and the way OSC used it was kind of complicated and hard to follow. The most we might have done was monitor that Apache was online , nothing really specific to OnDemand itself.
Some suggestions from Alan -
┆Issue is synchronized with this Asana task by Unito