Open automatingerror opened 2 years ago
Thinking about it, the solution doesn't need to be very complicated. We could just have a little script that runs as a cron job and emails the IT committee if disk usage rises above 90%.
Actions taken:
/metrics
on clubhouse.tcmaker.org and direct that to node_exporter instead of djangoTo do
In order to be aware of upcoming maintenance needs in case of log overflows or other issues that would fill up disk instances monitoring needs to be deployed