facebookincubator / dynolog

Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also integrates with pytorch and can trigger traces for distributed training applications.
MIT License
227 stars 34 forks source link

refactor to use MetricFrame to store task netns counters #139

Closed Alston-Tang closed 1 year ago

Alston-Tang commented 1 year ago

Summary: use MetricFrame to store last 60 seconds history of tupperware task netns counters and change the collection granularity of netns counters to 1 sample per second.

In this way, we will be able to still report netns metrics of last 1 minute reporting window.

Differential Revision: D45803407

facebook-github-bot commented 1 year ago

This pull request was exported from Phabricator. Differential Revision: D45803407