facebookincubator / dynolog

Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also integrates with pytorch and can trigger traces for distributed training applications.
MIT License
187 stars 34 forks source link

Add nvidia_nvlink_c2c1 PMU #263

Closed bigzachattack closed 1 month ago

bigzachattack commented 1 month ago

Summary: NVIDIA has to Chip-to-chip PMUs, each one is responsible for different measurements:

{F1645197609}

Differential Revision: D57635524

facebook-github-bot commented 1 month ago

This pull request was exported from Phabricator. Differential Revision: D57635524

facebook-github-bot commented 1 month ago

This pull request was exported from Phabricator. Differential Revision: D57635524

facebook-github-bot commented 1 month ago

This pull request was exported from Phabricator. Differential Revision: D57635524

facebook-github-bot commented 1 month ago

This pull request was exported from Phabricator. Differential Revision: D57635524

facebook-github-bot commented 1 month ago

This pull request has been merged in facebookincubator/dynolog@62265f95787e4137ee8309f9b8a2f211a65a5fe4.