Describe the suggestion
Add embedded performance profiling capability.
Justification
Optional capability to enable reporting of execution time required across major functions within OmniPerf . Useful for ongoing development optimization and performance regression detection.
Implementation
Include a timer class that can be used to demarcate start/stop for regions of interest and aggregate wall-clock execution.
Additional Notes
Integrate with logger option mentioned above or have separate command-line argument to enable.
Describe the suggestion Add embedded performance profiling capability.
Justification Optional capability to enable reporting of execution time required across major functions within OmniPerf . Useful for ongoing development optimization and performance regression detection.
Implementation Include a timer class that can be used to demarcate start/stop for regions of interest and aggregate wall-clock execution.
Additional Notes Integrate with logger option mentioned above or have separate command-line argument to enable.
Originally posted by @koomie in https://github.com/AMDResearch/omniperf/discussions/153#discussioncomment-6630057