Closed victorb closed 6 years ago
Remember what this means?
- Can check out the monitoring dashboards
Can view the monitoring dashboard? Or: can run the dashboard locally, in an automated way?
@lgierth my intention was "can view the monitoring dashboard".
Thinking about it now, we should probably have two tasks, one for being able to run monitoring locally and one for having it working in production. What you think?
Yeah let that running-locally be separate: ipfs/infrastructure#52
Ok, in that case, this task would depend on us running jenkins deployed somewhere before you can start working on this task. Correct?
Nah I can get started with the local jenkins
The dashboard will need tuning when there's actual jobs to monitor :) And that's also when we can start setting alert conditions.
Prometheus is currently set to scrape [fce3:5702:8051:3e65:3a36:1299:c458:1470]:8090/prometheus
and we can change that to what comes out of #8.
I'm wrapping this up with ipfs/infrastructure#235 which makes all provsn units systemd-compatible, so that we now also get host metrics (cpu, ram, io).
The dashboard should start showing numbers tomorrow when @VictorBjelkholm brings jenkins back up (I broke it). Over the rest of the sprint we'll tune the dashboard and add alerts as we see fit.
Splitting off tuning and alerting to #31.
Moved infrastructure for jenkins and monitoring is not setup yet. Reopening this in the meantime.
Yay, jenkins monitoring dashboard is back online!
Acceptance Criteria
Tasks
Dependencies
Depends on #1