ROCm / rocmProfileData

MIT License
14 stars 8 forks source link

Add examples to compute kernel launch delay and inter-kernel delay #36

Closed mwootton closed 1 year ago

mwootton commented 1 year ago

Add to helpful queries. Join kernelApi and op; add queue depth (number of kernels already enqueued when a new one is enqueued) One .cmd to show launch delay based on kernel (when there is no queue depth) One .cmd to show time lost in inter-kernel switching (when there is queue depth)