ROCm / rocprofiler-compute

Advanced Profiling and Analytics for AMD Hardware
https://rocm.docs.amd.com/projects/omniperf/en/latest/
MIT License
135 stars 49 forks source link

Option to show per-XCD memory metrics #369

Open ryanswann-amd opened 5 months ago

ryanswann-amd commented 5 months ago

Is your feature request related to a problem? Please describe. I am trying to optimize an application and would like better visibility into how to get good reuse.

Describe the solution you'd like I would like the option for the memory map to show memory metrics hierarchically (for example, per-XCD) as the memory is disaggregated across the hierarchy (as mentioned here : https://www.amd.com/content/dam/amd/en/documents/instinct-tech-docs/white-papers/amd-cdna-3-white-paper.pdf)

Describe alternatives you've considered I've considered using the profiler and microbenchmarks to get XCD-level information and hacking together my own data analysis infrastructure.

Thanks!