desbma / hddfancontrol

Regulate fan speed according to hard drive temperature
GNU General Public License v3.0
134 stars 17 forks source link

SAS drives break funcionality on master branch #50

Closed exodus454 closed 7 months ago

exodus454 commented 7 months ago

I switched from the SAS branch to the master and it starts crashing again:

hddfancontrol -d /dev/sdb /dev/sdc /dev/sdd /dev/sde /dev/sdf /dev/sdg /dev/sdh /dev/sdj /dev/sdi -p /sys/class/hwmon/hwmon12/pwm2 /sys/class/hwmon/hwmon12/pwm5 --pwm-start-value 1 1 --pwm-stop-value 0 0 --min-fan-speed-prct 1 -i 3600 --spin-down-time 12000 --min-temp=30 --max-temp=60 -v debug 2024-02-06 10:45:42,230 INFO [Main] Process real time scheduler set to 2, priority 49 2024-02-06 10:45:42,232 DEBUG [Fan #1] Fan is in enabled state 0 2024-02-06 10:45:42,232 DEBUG [Fan #1] PWM value is currently 255 2024-02-06 10:45:42,232 DEBUG [Fan #2] Fan is in enabled state 1 2024-02-06 10:45:42,232 DEBUG [Fan #2] PWM value is currently 148 2024-02-06 10:45:42,240 INFO [sdb Samsung SSD 850 PRO 256GB] Will probe temperature with method DRIVETEMP 2024-02-06 10:45:42,243 INFO [sdc HGST HUS726060ALA640] Will probe temperature with method DRIVETEMP 2024-02-06 10:45:42,245 INFO [sdd WDC WD2500HHTZ-60N21V0] Will probe temperature with method DRIVETEMP 2024-02-06 10:45:42,248 INFO [sde Samsung SSD 850 EVO 500GB] Will probe temperature with method DRIVETEMP 2024-02-06 10:45:42,250 INFO [sdf WDC WD2500HHTZ-60N21V0] Will probe temperature with method DRIVETEMP 2024-02-06 10:45:42,252 INFO [sdg HGST HUS726060ALA640] Will probe temperature with method DRIVETEMP 2024-02-06 10:45:42,255 INFO [sdh HGST HUS726060ALE610] Will probe temperature with method DRIVETEMP 2024-02-06 10:45:42,258 ERROR [Main] AssertionError: 2024-02-06 10:45:42,258 INFO [Fan #1] Setting fan speed to 100% 2024-02-06 10:45:42,258 WARNING [Fan #1] /sys/class/hwmon/hwmon12/pwm2_enable was 0, setting it to 1 2024-02-06 10:45:42,258 DEBUG [Fan #1] Setting PWM value to 255 2024-02-06 10:45:42,258 INFO [Fan #2] Setting fan speed to 100% 2024-02-06 10:45:42,258 DEBUG [Fan #2] Setting PWM value to 255

Back to the SAS branch: `Finished processing dependencies for hddfancontrol==1.5.0 root@fedora ~/hddfancontrol (sas)# hddfancontrol -d /dev/sdb /dev/sdc /dev/sdd /dev/sde /dev/sdf /dev/sdg /dev/sdh /dev/sdj /dev/sdi -p /sys/class/hwmon/hwmon12/pwm2 /sys/class/hwmon/hwmon12/pwm5 --pwm-start-value 1 1 --pwm-stop-value 0 0 --min-fan-speed-prct 1 -i 3600 --spin-down-time 12000 --min-temp=30 --max-temp=60 -v debug 2024-02-06 10:51:38,447 INFO [Main] Process real time scheduler set to 2, priority 49 2024-02-06 10:51:38,455 INFO [sdb Samsung SSD 850 PRO 256GB] Will probe temperature with method DRIVETEMP 2024-02-06 10:51:38,458 INFO [sdc HGST HUS726060ALA640] Will probe temperature with method DRIVETEMP 2024-02-06 10:51:38,460 INFO [sdd WDC WD2500HHTZ-60N21V0] Will probe temperature with method DRIVETEMP 2024-02-06 10:51:38,464 INFO [sde Samsung SSD 850 EVO 500GB] Will probe temperature with method DRIVETEMP 2024-02-06 10:51:38,488 INFO [sdf WDC WD2500HHTZ-60N21V0] Will probe temperature with method DRIVETEMP 2024-02-06 10:51:38,503 INFO [sdg HGST HUS726060ALA640] Will probe temperature with method DRIVETEMP 2024-02-06 10:51:38,507 INFO [sdh HGST HUS726060ALE610] Will probe temperature with method DRIVETEMP 2024-02-06 10:51:38,823 INFO [sdj HUH728080AL4200] Drive does not support native drivetemp temp query 2024-02-06 10:51:38,829 WARNING [sdj HUH728080AL4200] Drive does not support HGST temp query 2024-02-06 10:51:38,829 INFO [sdj HUH728080AL4200] Will probe temperature with method HDDTEMP_INVOCATION 2024-02-06 10:51:39,570 INFO [sdi ST6000NM0034] Drive does not support native drivetemp temp query 2024-02-06 10:51:39,571 WARNING [sdi ST6000NM0034] Drive does not support HGST temp query 2024-02-06 10:51:39,571 INFO [sdi ST6000NM0034] Will probe temperature with method HDDTEMP_INVOCATION 2024-02-06 10:51:39,574 DEBUG [sdd WDC WD2500HHTZ-60N21V0] Drive state: ACTIVE_IDLE 2024-02-06 10:51:39,575 DEBUG [sdf WDC WD2500HHTZ-60N21V0] Drive state: ACTIVE_IDLE 2024-02-06 10:51:39,575 DEBUG [DriveSpinDownThread-sdd WDC WD2500HHTZ-60N21V0] Sleeping for 60 seconds 2024-02-06 10:51:39,575 DEBUG [sdb Samsung SSD 850 PRO 256GB] Drive state: ACTIVE_IDLE 2024-02-06 10:51:39,576 DEBUG [DriveSpinDownThread-sdf WDC WD2500HHTZ-60N21V0] Sleeping for 60 seconds 2024-02-06 10:51:39,576 DEBUG [sdc HGST HUS726060ALA640] Drive state: ACTIVE_IDLE 2024-02-06 10:51:39,576 DEBUG [DriveSpinDownThread-sdc HGST HUS726060ALA640] Sleeping for 60 seconds 2024-02-06 10:51:39,576 DEBUG [sdg HGST HUS726060ALA640] Drive state: ACTIVE_IDLE 2024-02-06 10:51:39,576 DEBUG [DriveSpinDownThread-sdg HGST HUS726060ALA640] Sleeping for 60 seconds 2024-02-06 10:51:39,577 DEBUG [sde Samsung SSD 850 EVO 500GB] Drive state: ACTIVE_IDLE 2024-02-06 10:51:39,577 DEBUG [DriveSpinDownThread-sde Samsung SSD 850 EVO 500GB] Sleeping for 60 seconds 2024-02-06 10:51:39,577 DEBUG [sdb Samsung SSD 850 PRO 256GB] Drive state: ACTIVE_IDLE 2024-02-06 10:51:39,578 DEBUG [sdi ST6000NM0034] Drive state: UNKNOWN 2024-02-06 10:51:39,578 DEBUG [DriveSpinDownThread-sdb Samsung SSD 850 PRO 256GB] Sleeping for 60 seconds 2024-02-06 10:51:39,578 DEBUG [DriveSpinDownThread-sdi ST6000NM0034] Sleeping for 60 seconds 2024-02-06 10:51:39,578 DEBUG [sdh HGST HUS726060ALE610] Drive state: ACTIVE_IDLE 2024-02-06 10:51:39,578 DEBUG [DriveSpinDownThread-sdh HGST HUS726060ALE610] Sleeping for 60 seconds 2024-02-06 10:51:39,578 DEBUG [sdb Samsung SSD 850 PRO 256GB] Drive temperature: 21 °C 2024-02-06 10:51:39,579 DEBUG [sdj HUH728080AL4200] Drive state: UNKNOWN 2024-02-06 10:51:39,579 DEBUG [DriveSpinDownThread-sdj HUH728080AL4200] Sleeping for 60 seconds 2024-02-06 10:51:39,580 DEBUG [sdc HGST HUS726060ALA640] Drive state: ACTIVE_IDLE 2024-02-06 10:51:39,581 DEBUG [sdc HGST HUS726060ALA640] Drive temperature: 31 °C 2024-02-06 10:51:39,582 DEBUG [sdd WDC WD2500HHTZ-60N21V0] Drive state: ACTIVE_IDLE 2024-02-06 10:51:39,582 DEBUG [sdd WDC WD2500HHTZ-60N21V0] Drive temperature: 23 °C 2024-02-06 10:51:39,584 DEBUG [sde Samsung SSD 850 EVO 500GB] Drive state: ACTIVE_IDLE 2024-02-06 10:51:39,584 DEBUG [sde Samsung SSD 850 EVO 500GB] Drive temperature: 23 °C 2024-02-06 10:51:39,585 DEBUG [sdf WDC WD2500HHTZ-60N21V0] Drive state: ACTIVE_IDLE 2024-02-06 10:51:39,585 DEBUG [sdf WDC WD2500HHTZ-60N21V0] Drive temperature: 23 °C 2024-02-06 10:51:39,587 DEBUG [sdg HGST HUS726060ALA640] Drive state: ACTIVE_IDLE 2024-02-06 10:51:39,588 DEBUG [sdg HGST HUS726060ALA640] Drive temperature: 23 °C 2024-02-06 10:51:39,591 DEBUG [sdh HGST HUS726060ALE610] Drive state: ACTIVE_IDLE 2024-02-06 10:51:39,591 DEBUG [sdh HGST HUS726060ALE610] Drive temperature: 32 °C 2024-02-06 10:51:39,593 DEBUG [sdj HUH728080AL4200] Drive state: UNKNOWN 2024-02-06 10:51:39,661 DEBUG [sdj HUH728080AL4200] Drive temperature: 27 °C 2024-02-06 10:51:39,663 DEBUG [sdi ST6000NM0034] Drive state: UNKNOWN 2024-02-06 10:51:39,671 DEBUG [sdi ST6000NM0034] Drive temperature: 32 °C 2024-02-06 10:51:39,671 INFO [Main] Maximum device temperature: 32 °C 2024-02-06 10:51:39,671 INFO [Fan #1] Setting fan speed to 6% 2024-02-06 10:51:39,673 WARNING [Fan #1] /sys/class/hwmon/hwmon12/pwm2_enable was 0, setting it to 1 2024-02-06 10:51:39,673 DEBUG [Fan #1] Setting PWM value to 15 2024-02-06 10:51:39,673 INFO [Fan #2] Setting fan speed to 6% 2024-02-06 10:51:39,673 DEBUG [Fan #2] Setting PWM value to 15 2024-02-06 10:51:39,674 DEBUG [Main] Sleeping for 20 seconds

More info:

uname -a Linux fedora 6.1.75-200.fc38.x86_64 #1 SMP PREEMPT_DYNAMIC Fri Jan 26 10:57:39 UTC 2024 x86_64 GNU/Linux

lsscsi -i
[0:0:0:0] disk ATA HGST HUS726060AL T7J0 /dev/sda - [2:0:0:0] disk ATA Samsung SSD 850 2B6Q /dev/sdb - [4:0:0:0] disk ATA HGST HUS726060AL T1EN /dev/sdc - [5:0:0:0] disk ATA WDC WD2500HHTZ-6 6A01 /dev/sdd - [6:0:0:0] disk ATA Samsung SSD 850 1B6Q /dev/sde - [7:0:0:0] disk ATA WDC WD2500HHTZ-6 6A01 /dev/sdf - [10:0:0:0] disk ATA HGST HUS726060AL T1EN /dev/sdg - [10:0:1:0] disk ATA HGST HUS726060AL TD05 /dev/sdh - [10:0:2:0] disk SEAGATE ST6000NM0034 E0G5 /dev/sdi 35000c5008dc78123 [10:0:3:0] disk HGST HUH728080AL4200 A7J0 /dev/sdj 35000cca2540e4ce4 [10:0:4:0] disk ATA HGST HUS726060AL T1EN /dev/sdk - [10:0:5:0] disk ATA HGST HUS726060AL TD05 /dev/sdl - [10:0:6:0] disk SEAGATE ST6000NM0095 E004 /dev/sdm 35000c500a61f52d7 [10:0:7:0] disk SEAGATE ST6000NM0095 E004 /dev/sdn 35000c500a61f58f3 [N:0:0:1] disk TEAM TM8FPD512G__1 /dev/nvme0n1

Drives are a mix of brands, sas/SATA/NVME. sdi, sdj,sdm,sdn are SAS. The daemon crashes immediately when invoking a SAS drive, regardless of the order. Also tried --smartctl and --hddtemp-daemon, same results.

hddtemp
/dev/sda: HGST HUS726060ALE610: drive is sleeping /dev/sdb: Samsung SSD 850 PRO 256GB: drive is sleeping /dev/sdc: HGST HUS726060ALA640: 30°C /dev/sdd: WDC WD2500HHTZ-60N21V0: 28°C /dev/sde: Samsung SSD 850 EVO 500GB: 23°C /dev/sdf: WDC WD2500HHTZ-60N21V0: 29°C /dev/sdg: HGST HUS726060ALA640: 27°C /dev/sdh: HGST HUS726060ALE610: 41°C /dev/sdi: SEAGATE ST6000NM0034: 41°C /dev/sdj: HGST HUH728080AL4200: 36°C /dev/sdk: HGST HUS726060ALA640: 33°C /dev/sdl: HGST HUS726060ALE610: 33°C /dev/sdm: SEAGATE ST6000NM0095: 31°C /dev/sdn: SEAGATE ST6000NM0095: 32°C

sensors
drivetemp-scsi-10-10 Adapter: SCSI adapter temp1: +41.0°C (low = +0.0°C, high = +60.0°C) (crit low = -40.0°C, crit = +70.0°C) (lowest = +29.0°C, highest = +53.0°C)

nct6792-isa-0a10 Adapter: ISA adapter in0: 896.00 mV (min = +0.00 V, max = +1.74 V) in1: 984.00 mV (min = +0.00 V, max = +0.00 V) ALARM in2: 3.34 V (min = +0.00 V, max = +0.00 V) ALARM in3: 3.34 V (min = +0.00 V, max = +0.00 V) ALARM in4: 1000.00 mV (min = +0.00 V, max = +0.00 V) ALARM in5: 1.26 V (min = +0.00 V, max = +0.00 V) ALARM in6: 1.04 V (min = +0.00 V, max = +0.00 V) ALARM in7: 3.36 V (min = +0.00 V, max = +0.00 V) ALARM in8: 3.28 V (min = +0.00 V, max = +0.00 V) ALARM in9: 1.04 V (min = +0.00 V, max = +0.00 V) ALARM in10: 1.09 V (min = +0.00 V, max = +0.00 V) ALARM in11: 1.26 V (min = +0.00 V, max = +0.00 V) ALARM in12: 600.00 mV (min = +0.00 V, max = +0.00 V) ALARM in13: 600.00 mV (min = +0.00 V, max = +0.00 V) ALARM in14: 1.83 V (min = +0.00 V, max = +0.00 V) ALARM fan1: 696 RPM (min = 0 RPM) fan2: 0 RPM (min = 0 RPM) fan3: 1061 RPM (min = 0 RPM) fan4: 0 RPM (min = 0 RPM) fan5: 791 RPM (min = 0 RPM) SYSTIN: +40.0°C (high = +0.0°C, hyst = +0.0°C) ALARM sensor = CPU diode CPUTIN: +44.0°C (high = +107.0°C, hyst = +102.0°C) sensor = thermistor AUXTIN0: +24.0°C sensor = thermistor AUXTIN1: +21.0°C sensor = thermistor AUXTIN2: +13.0°C sensor = thermistor AUXTIN3: -26.0°C sensor = thermistor PECI Agent 0: +53.0°C
PCH_CHIP_CPU_MAX_TEMP: +0.0°C
PCH_CHIP_TEMP: +0.0°C
PCH_CPU_TEMP: +0.0°C
intrusion0: ALARM intrusion1: ALARM beep_enable: disabled

drivetemp-scsi-7-0 Adapter: SCSI adapter temp1: +29.0°C (low = +0.0°C, high = +60.0°C) (crit low = -41.0°C, crit = +85.0°C) (lowest = +21.0°C, highest = +41.0°C)

drivetemp-scsi-10-50 Adapter: SCSI adapter temp1: +33.0°C (low = +0.0°C, high = +60.0°C) (crit low = -40.0°C, crit = +70.0°C) (lowest = +26.0°C, highest = +47.0°C)

drivetemp-scsi-5-0 Adapter: SCSI adapter temp1: +28.0°C (low = +0.0°C, high = +60.0°C) (crit low = -41.0°C, crit = +85.0°C) (lowest = +20.0°C, highest = +42.0°C)

drivetemp-scsi-2-0 Adapter: SCSI adapter temp1: +21.0°C

nvme-pci-0400 Adapter: PCI adapter Composite: +42.9°C (low = -273.1°C, high = +89.8°C) (crit = +94.8°C) Sensor 1: +42.9°C (low = -273.1°C, high = +65261.8°C) Sensor 2: +31.9°C (low = -273.1°C, high = +65261.8°C)

drivetemp-scsi-10-40 Adapter: SCSI adapter temp1: +33.0°C (low = +0.0°C, high = +60.0°C) (crit low = -40.0°C, crit = +70.0°C) (lowest = +24.0°C, highest = +43.0°C)

drivetemp-scsi-10-0 Adapter: SCSI adapter temp1: +27.0°C (low = +0.0°C, high = +60.0°C) (crit low = -40.0°C, crit = +70.0°C) (lowest = +20.0°C, highest = +43.0°C)

coretemp-isa-0000 Adapter: ISA adapter Package id 0: +50.0°C (high = +77.0°C, crit = +95.0°C) Core 0: +50.0°C (high = +77.0°C, crit = +95.0°C) Core 1: +47.0°C (high = +77.0°C, crit = +95.0°C) Core 4: +42.0°C (high = +77.0°C, crit = +95.0°C) Core 5: +37.0°C (high = +77.0°C, crit = +95.0°C)

drivetemp-scsi-6-0 Adapter: SCSI adapter temp1: +23.0°C (low = +0.0°C, high = +70.0°C) (crit low = +0.0°C, crit = +70.0°C) (lowest = +20.0°C, highest = +40.0°C)

drivetemp-scsi-4-0 Adapter: SCSI adapter temp1: +30.0°C (low = +0.0°C, high = +60.0°C) (crit low = -40.0°C, crit = +70.0°C) (lowest = +22.0°C, highest = +37.0°C)

drivetemp-scsi-0-0 Adapter: SCSI adapter temp1: +27.0°C (low = +0.0°C, high = +60.0°C) (crit low = -40.0°C, crit = +70.0°C) (lowest = +24.0°C, highest = +46.0°C)

desbma commented 7 months ago

You must be using an old version of the SAS branch, because there is currently no difference with master.

Can you try master on last commit 202811314ac999e119779a44c0efcfc9050b6f8, I have added more logging to understand your issue?

exodus454 commented 7 months ago

Now it's working 😂