ROCm / rocprofiler-compute

Advanced Profiling and Analytics for AMD Hardware
https://rocm.docs.amd.com/projects/omniperf/en/latest/
MIT License
135 stars 49 forks source link

Better normalization modes over multiple kernels. #175

Open coleramos425 opened 1 year ago

coleramos425 commented 1 year ago

Describe the suggestion

Better normalization modes over multiple kernels.

Justification

In conversation with users, I have found that there is significant confusion over values that are presented when multiple kernels are selected for analysis. In particular, folks ask questions like "why did my bandwidth go down when I executed <10x more kernels>"? My feeling is that the way we present normalization over multiple kernels is flawed.

Implementation

Options include:

Originally posted by @arghdos in https://github.com/AMDResearch/omniperf/discussions/153#discussioncomment-6576521