amd_smi_exporter cannot detect any any GPUs [1], rocm-smi shows 4 gpus [2], and amd-smi got messed up, showing internal errors on the screen (just a symptom of potential internal corruption of the system)
[1] amd_smi_exporter shows no GPUs
[2] rocm-smi shows 4 GPUs
[3] amd-smi was messed up during the installation process
Hi lamini, We haven't observed any issues.
Could you please checkout the latest https://github.com/ROCm/amdsmi and install and then try with amd_smi_exporter.
will provide steps if needed.
amd_smi_exporter cannot detect any any GPUs [1], rocm-smi shows 4 gpus [2], and amd-smi got messed up, showing internal errors on the screen (just a symptom of potential internal corruption of the system)
[1] amd_smi_exporter shows no GPUs
[2] rocm-smi shows 4 GPUs
[3] amd-smi was messed up during the installation process