influxdata / telegraf

Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.
https://influxdata.com/telegraf
MIT License
14.64k stars 5.58k forks source link

Add support for linux perf_events subsystem #3654

Closed juju4 closed 1 year ago

juju4 commented 6 years ago

System info:

Linux

Feature Request

Capsule8 has published 2 blog posts to detect meltdown/spectre poc through

should be similar for perfmon on Windows unsure on macos

Telegraf has inputs from https://github.com/influxdata/telegraf/tree/release-1.5/plugins/inputs/win_perf_counters https://github.com/influxdata/telegraf/tree/release-1.5/plugins/inputs/sysstat but no linux perf as far as I can tell

Proposal:

Any plan to support linux perf?

In the meantime, sysstat can be use for page faults monitoring but I don't think it's an option for cache side channels activity. Sadly, I'm not sure if options for macos to do the same.

Use case: [Why is this important (helps with prioritizing requests)]

Better troubleshooting of hardware activity and security monitoring as hardware exploitation will most probably continue to develop.

Thanks

danielnelson commented 6 years ago

Due to the never ending stream of security bugs I don't know if we will add any plugins that directly target specific issues, but we could add new metrics that allow monitoring to be done.

There are no concrete plans at this time for adding support for perf_events, but it would be a nice feature to have.

juju4 commented 6 years ago

Thanks @danielnelson

ok for perf_events not available. any comments on page fault availability/usability?

danielnelson commented 6 years ago

In the kernel_vmstat plugin there is a pgfault and pgmajfault field, these are taken from /proc/vmstat, does that look like the data you are after?

juju4 commented 6 years ago

it seems so. Thanks a lot @danielnelson!

powersj commented 1 year ago

Closing as user got the various metrics they were looking for.