ROCm / rocprofiler-compute

Advanced Profiling and Analytics for AMD Hardware
https://rocm.docs.amd.com/projects/omniperf/en/latest/
MIT License
135 stars 49 forks source link

Roofline based on dispatch ID #435

Open etiennemlb opened 1 month ago

etiennemlb commented 1 month ago

I have profiled an application like so:

$ omniperf profile -n <case_name> --device=0 -- ./test_binary

Now I want, for multiple dispatch, raw some roofline:

$ # Calling this does not help as it'll generate rooflines only 
$ # on the first dispatched kernel. Even though I specified a non zero dispatch index.
$ omniperf profile -n <case_name> --roof-only --kernel-names --mem-level HBM --dispatch 546 -- ../libsplb2.mi250/libsplb2/build/tests/test_concurrency_ncal_ncal.hip
coleramos425 commented 1 month ago

Can you please share what ROCm version you're using? Additionally, could you also attach your log.txt file that was generated from this run for debugging purposes.

etiennemlb commented 1 month ago

I'm using rocm 5.5.1, 5.7.1, 6.1.2.

And, Omniperf 2.1.0