ROCm / rocprofiler-compute

Advanced Profiling and Analytics for AMD Hardware
https://rocm.docs.amd.com/projects/omniperf/en/latest/
MIT License
135 stars 49 forks source link
gpu-kernels hardware-counters hpc linux performance-analysis profiling

Ubuntu 22.04 RHEL 8 Instinct Docs DOI

Omniperf

General

Omniperf is a system performance profiling tool for machine learning/HPC workloads running on AMD MI GPUs. The tool presently targets usage on MI100, MI200, and MI300 accelerators.

Development

Omniperf follows a main-dev branching model. As a result, our latest stable release is shipped from the amd-mainline branch, while new features are developed in our amd-staging branch.

Users may checkout amd-staging to preview upcoming features.

How to Cite

This software can be cited using a Zenodo DOI reference. A BibTex style reference is provided below for convenience:

@software{xiaomin_lu_2022_7314631
  author       = {Xiaomin Lu and
                  Cole Ramos and
                  Fei Zheng and
                  Karl W. Schulz and
                  Jose Santos and
                  Keith Lowery and
                  Nicholas Curtis and
                  Cristian Di Pietrantonio},
  title        = {AMDResearch/omniperf: v2.1.0 (27 Sept 2024)},
  month        = sept,
  year         = 2024,
  publisher    = {Zenodo},
  version      = {v2.1.0},
  doi          = {10.5281/zenodo.7314631},
  url          = {https://doi.org/10.5281/zenodo.7314631}
}