createthis / linux_gpu_mining

Some useful scripts for GPU mining on Linux
114 stars 115 forks source link

Nvidia-settings fan speed #1

Open Canefire opened 6 years ago

Canefire commented 6 years ago

I have installed using this repository on three systems, each with 6 Zotac GTX 1070 GPUs with great success, except in controlling GPU fan speed. There is an error consistent to all builds when running ./underclock.sh. When I look at Nvidia-settings there is no option to control the fan speed and the fans are not running at 80%. I have temps of 70 to 71C on the GPUs so I believe they are auto cooling at a temp setting of 70 perhaps. I would very much like to speed them up to 90%, but any changes made in the underclock.sh file do not take effect. I have tried to find a solution, but it seems there is no clear way to do this with the Nvidia drivers. This error occurs with the fan settings at 80 as in the repository example file as well, Power settings are taking effect, I have set to 125 watts per card.

Ubuntu 16.04 Nvidia driver 384.90 Screen on run follows:


ERROR: Error assigning value 90 to attribute 'GPUTargetFanSpeed' (canefire-Z270P-D3:0[fan:0]) as specified in assignment '[fan:0]/GPUTargetFanSpeed=90' (Unknown Error).

ERROR: Error assigning value 90 to attribute 'GPUTargetFanSpeed' (canefire-Z270P-D3:0[fan:1]) as specified in assignment '[fan:1]/GPUTargetFanSpeed=90' (Unknown Error).

./underclock.sh: 4: ./underclock.sh: Sudo: not found

ERROR: Error assigning value 90 to attribute 'GPUTargetFanSpeed' (canefire-Z270P-D3:0[fan:3]) as specified in assignment '[fan:3]/GPUTargetFanSpeed=90' (Unknown Error).

./underclock.sh: 6: ./underclock.sh: Sudo: not found

ERROR: Error assigning value 90 to attribute 'GPUTargetFanSpeed' (canefire-Z270P-D3:0[fan:5]) as specified in assignment '[fan:5]/GPUTargetFanSpeed=90' (Unknown Error).

Persistence mode is already Enabled for GPU 00000000:01:00.0. Persistence mode is already Enabled for GPU 00000000:03:00.0. Persistence mode is already Enabled for GPU 00000000:04:00.0. Persistence mode is already Enabled for GPU 00000000:05:00.0. Persistence mode is already Enabled for GPU 00000000:08:00.0. Persistence mode is already Enabled for GPU 00000000:09:00.0. All done. Power limit for GPU 00000000:01:00.0 was set to 125.00 W from 125.00 W. Power limit for GPU 00000000:03:00.0 was set to 125.00 W from 125.00 W. Power limit for GPU 00000000:04:00.0 was set to 125.00 W from 125.00 W. Power limit for GPU 00000000:05:00.0 was set to 125.00 W from 125.00 W. Power limit for GPU 00000000:08:00.0 was set to 125.00 W from 125.00 W. Power limit for GPU 00000000:09:00.0 was set to 125.00 W from 125.00 W. All done.

createthis commented 6 years ago

This could happen if you fail to run:

./enable_nvidia_fan_control.sh
./restart_ubuntu_lightdm.sh

If you look at the source code for enable_nvidia_fan_control.sh you'll see that it run this command: sudo nvidia-xconfig --cool-bits=12

"cool-bits" is what enables fan control using nvidia-xconfig. Note that I didn't write nvidia-xconfig so I really can't help debug that command.

UniversusTech commented 6 years ago

I get the same issue...

Also, after each time I restart the system, for example, when I run:

./restart_ubuntu_lightdm.sh

my xorg.conf gets reset to its original values. Thus, creating a file next to it called xorg.conf.02122018

Ubuntu 16.04 Nvidia driver 390.12 GTX 1080 ti GPUs

Alekseisu commented 6 years ago

Connect the monitor to the PСI(rizer)-GPUx16 and do everything again. I solved the problem