facebookincubator / dynolog

Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also integrates with pytorch and can trigger traces for distributed training applications.
MIT License
260 stars 38 forks source link

add pause resume for dcgm #100

Closed haowangludx closed 1 year ago

haowangludx commented 1 year ago

Summary: DCGM conflicts with other NV tools and libraries, adding pause and resume functionality for oss dynolog

Differential Revision: D42753238

Privacy Context Container: L1137347

facebook-github-bot commented 1 year ago

This pull request was exported from Phabricator. Differential Revision: D42753238

facebook-github-bot commented 1 year ago

This pull request was exported from Phabricator. Differential Revision: D42753238

facebook-github-bot commented 1 year ago

This pull request was exported from Phabricator. Differential Revision: D42753238

facebook-github-bot commented 1 year ago

This pull request was exported from Phabricator. Differential Revision: D42753238

facebook-github-bot commented 1 year ago

This pull request has been merged in facebookincubator/dynolog@740b9c8e5ae52236268d9edb194c34987136ca95.