Closed JohannesFleischer closed 3 months ago
Hi @JohannesFleischer
Quick explanation. The value of aide_violation_count
in the time series DB of Prometheus is being set inside our Aide BOSH Release. The release has a run_report
task that generates a file that contains that metric. You can see that being done here.
On all our BOSH deployments, we co-locate the node_exporter
release which is configured to read in files placed in it's config
folder and push the file contents to Prometheus via the node gateway
deployed with Prometheus. The run_report
task has logic to find the errors and increment that count or leave it at zero. Any value greater then zero triggers the alert.
I'm completely new to Prometheus and just starting to use it. My current goal is to set up an alerting rule for aide. Therefore, I searched for examples and found this configuration snipped in your repo:
And I really don't understand how this works and would be very happy if you could explain it to me. Especially where the
aide_violation_count
value comes from, because I can't find any mention of Prometheus supporting aide natively and don't see where this value gets scraped.Sorry for any inconveniences.