Open 842974287 opened 5 months ago
Hi, for nvidia gpu we can use nvidia-smi to check ECC mode, and use the tool to turn ECC on/off. I couldn't find any documentation on AMD ECC mode. Wondering if rocm-smi also supports checking it for AMD GPUs?
No response
MI300x
Both rocm-smi and amd-smi tools can show the ECC status:
rocm-smi
amd-smi
rocm-smi --showrasinfo
amd-smi monitor --ecc
The turn on/off setting feature is not available at the moment.
Suggestion Description
Hi, for nvidia gpu we can use nvidia-smi to check ECC mode, and use the tool to turn ECC on/off. I couldn't find any documentation on AMD ECC mode. Wondering if rocm-smi also supports checking it for AMD GPUs?
Operating System
No response
GPU
MI300x
ROCm Component
No response