Open saki2fifty opened 1 year ago
I have many rigs and this happens across all of them. RX580's in the screenshot.
Can you provide more info on when the power limit resets? How long does it take for it to reset? Does anything else (clocks, performance level) reset? Is anything printed in system logs (dmesg)?
This could be solved by performing a check every few minutes that compares the actual settings of the gpu to the previously applied ones, though this is not an ideal solution.
Also, I have to say that I did not expect LACT to be used for mining rigs. Good to know that the API is useful though.
Currently on a work call... but yeah, LACT is 100% useful. I use it instead of rocm-smi and is my default oc/stat'er.
I have a check every so many minutes to see if it changes and force it again.
I'd say it resets probably slowly starting with 15 minutes, one by one and within an hour all have reset.
But, i'll check those logs and let you know.
I have this issue too, no matter which power cap I set, the GPU starts "power throttling" even before it reaches the rated normal 145 watts max. I didn't get this issue on Linux Mint but I get it on Fedora 38/39 and Nobara 38 with and without amdgpu.ppfeaturemask=0xffffffff or amdgpu.ppfeaturemask=0xfffd7fff. It may be worth the time to test older kernels where this is possibly a non-issue and find which kernel version after this becomes a problem.
@pinbuck this sounds like a kernel-side issue, you should report it on https://gitlab.freedesktop.org/drm/amd.
When setting all devices to a max power cap of any number (in this screenshot, it was set to 100), over time it ignores this and the power adjusts automatically.
I loop through each device, but per device the commands are:
Exact Powershell: