amd / amd_smi_exporter

The AMD SMI Exporter exports AMD EPYC CPU & Datacenter GPU metrics to the Prometheus server.
Other
37 stars 8 forks source link

amd_smi_exporter results disagree with rocm-smi #12

Open yx-lamini opened 2 months ago

yx-lamini commented 2 months ago

amd_smi_exporter cannot detect any any GPUs [1], rocm-smi shows 4 gpus [2], and amd-smi got messed up, showing internal errors on the screen (just a symptom of potential internal corruption of the system)

[1] amd_smi_exporter shows no GPUs image

[2] rocm-smi shows 4 GPUs image

[3] amd-smi was messed up during the installation process image

muralimk-amd commented 1 month ago

Hi lamini, We haven't observed any issues. Could you please checkout the latest https://github.com/ROCm/amdsmi and install and then try with amd_smi_exporter. will provide steps if needed.