ROCm / rocprofiler-compute

Advanced Profiling and Analytics for AMD Hardware
https://rocm.docs.amd.com/projects/omniperf/en/latest/
MIT License
135 stars 49 forks source link

Use the rocprofiler API interface instead #171

Open coleramos425 opened 1 year ago

coleramos425 commented 1 year ago

Describe the suggestion Use the rocprofiler API interface instead of doing \

Justification The idea here is that we can be way more selective about which kernels we want to profile. For instance, we could give the users the mode to (attempt) to not replay the application at all, by e.g., cycling through various sets of counters to collect for successive launches of the 'same' kernel. This lines up with some of the stuff we've talking about internally re: kernel selection / cutting down replays.

Implementation

Originally posted by @arghdos in https://github.com/AMDResearch/omniperf/discussions/153#discussioncomment-6503094