intel / thermal_daemon

Thermal daemon for IA
GNU General Public License v2.0
535 stars 117 forks source link

thermald throttles CPU even though temperatures are fine #388

Closed jy-lefort closed 1 month ago

jy-lefort commented 1 year ago

I'm running Fedora 37, where thermald is installed by default.

At some point in January 2023 (likely after a dnf update), my C++ projects started compiling much slower than before (about twice as slow) and the machine would be unusually cold during a build. The culprit was thermald, which was throttling the CPU for no reason.

Running "dnf remove thermald" solved the issue.

My CPU is a i7-1280P.

spandruvada commented 1 year ago

Something triggers throttling. Some sensor was calling for throttling as defined by configuration, othewise thermald doesn't do throttle. Please attach logs to check:

systemctl stop thermald sudo thermald --no-daemon --loglevel=info --adaptive Then trigger the condition and attach the generated log.

jy-lefort commented 1 year ago

thermald.log

That's the result of running a parallel build. It seems to me that it throttles at 85°C:

[1676655320][INFO]Set : threshold:85000, temperature:94000, cdev:24(intel_pstate), curr_state:1, max_state:10

In that case it disables turbo (/sys/devices/system/cpu/intel_pstate/no_turbo becomes 1) and decreases max_perf_pct from 100% to as low as 70% (even 50% sometimes). Quite obviously, that has a catastrophic impact on performance.

I don't want thermald to do that, it didn't do that before the update. Without thermald, the CPU can get as hot as 100°C without any issues (100°C seems to be a kernel or CPU limit).

To summarize, thermald has to work out of the box (without having to tweak .xml files or downright uninstall it). It worked out of the box before.

spandruvada commented 1 year ago

This is because your system's thermal table has a bug, which didn't specify a target for balanced condition. Hence default limit kicked in where it calls to keep temp at 85C. So get throttled to reach that limit.

This need some workaround.

spandruvada commented 1 year ago

Please try to build from branch issue_388 and reproduce. Build procedure is in README.txt. https://github.com/intel/thermal_daemon/tree/issue_388

jy-lefort commented 1 year ago

Thanks for the quick feedback. Sorry but as removing thermald works for me, I don't really have time nor inclination to start building branches and so on.

By "my system's thermal table", do you mean the default configuration of thermald as shipped by Fedora? If it's the case, I can open a bug on the Fedora bug tracker and link to this report.

If, on the other hand, you suspect it is an issue with thermald itself, maybe Intel Corporation can contract me to fix this issue? As you can see in the log, my CPU is recent (12th gen i7) and it's not very nice that to novice users, it appears twice as slow as competing AMD CPUs.

spandruvada commented 1 year ago

The thermal configuration is from the manufacturer of your laptop. This is nothing to do with thermald or Fedora. Intel doesn't define thermal tables.

spandruvada commented 1 year ago

BTW, what is the make and model of this latop?

spandruvada commented 1 year ago

Also please file a bug report with Fedora. Link this issue, so that they can build a version.

jy-lefort commented 1 year ago

My laptop is a MSI Prestige 15.

Could it be a hardware issue such as a faulty temp sensor? It didn't do this initially, so I'm wondering.

Thanks for the details.

spandruvada commented 1 year ago

No. There is no HW issue. Probably there was some BIOS update which may have brought new tables. Thanks for reporting. I tested by simulating such table. So I will release it soon.

jy-lefort commented 1 year ago

Thanks for the very quick feedback. I don't know much about modern BIOSes, all I know is whether I shall use a constref or a rvalue ref in C++. Out of curiousity I'll probably investigate furthermore.

lnicola commented 1 year ago

I might be having the same issue (CC https://github.com/intel/thermal_daemon/issues/280). I have an LG Gram (i7-1260P, ADL), which ends up running at 400 MHz at random times, apparently unrelated to the system load or temperatures.

What I found in my investigation up to this point:

image

thermald log ``` [1679865920][INFO]RAPL domain count 0 [1679865920][INFO]RAPL domain count 1 [1679865920][MSG]32 CPUID levels; family:model:stepping 0x6:9a:3 (6:154:3) [1679865920][INFO]THD engine init failed [1679865920][INFO]--adaptive option failed on this platform [1679865920][INFO]Ignoring --adaptive option [1679865920][INFO]RAPL domain count 0 [1679865920][INFO]RAPL domain count 1 [1679865920][MSG]32 CPUID levels; family:model:stepping 0x6:9a:3 (6:154:3) [1679865920][INFO]sensor_update: type TCPU [1679865920][INFO]sensor_update: type acpitz [1679865920][INFO]sensor_update: type iwlwifi_1 [1679865920][INFO]sensor_update: type TCPU_PCI [1679865920][INFO]sensor_update: type INT3400 [1679865920][INFO]sensor_update: type x86_pkg_temp [1679865920][INFO]thd_read_default_thermal_sensors loaded 6 sensors [1679865920][INFO]dts /sys/devices/platform/coretemp.0/name doesn't exist [1679865920][MSG]sensor id 19 : No temp sysfs for reading raw temp [1679865920][MSG]sensor id 19 : No temp sysfs for reading raw temp [1679865920][MSG]sensor id 19 : No temp sysfs for reading raw temp [1679865920][INFO]INT3400 Base path is /sys/bus/acpi/devices/INTC1041:00/physical_node/uuids/ [1679865920][INFO]Passive 1 UUID is not present, hence ignore _TRT, as it may have junk!! [1679865920][MSG]Config file /etc/thermald/thermal-conf.xml does not exist [1679865920][INFO]sensor index:2 TCPU /sys/class/thermal/thermal_zone2/ Async:0 [1679865920][INFO]sensor index:0 acpitz /sys/class/thermal/thermal_zone0/ Async:0 [1679865920][INFO]sensor index:5 iwlwifi_1 /sys/class/thermal/thermal_zone5/ Async:0 [1679865920][INFO]sensor index:3 TCPU_PCI /sys/class/thermal/thermal_zone3/ Async:0 [1679865920][INFO]sensor index:1 INT3400 /sys/class/thermal/thermal_zone1/ Async:0 [1679865920][INFO]sensor index:4 x86_pkg_temp /sys/class/thermal/thermal_zone4/ Async:1 [1679865920][INFO]sensor index:6 hwmon /sys/class/hwmon/hwmon5/temp6_input Async:0 [1679865920][INFO]sensor index:7 hwmon /sys/class/hwmon/hwmon5/temp13_input Async:0 [1679865920][INFO]sensor index:8 hwmon /sys/class/hwmon/hwmon5/temp3_input Async:0 [1679865920][INFO]sensor index:9 hwmon /sys/class/hwmon/hwmon5/temp10_input Async:0 [1679865920][INFO]sensor index:10 hwmon /sys/class/hwmon/hwmon5/temp7_input Async:0 [1679865920][INFO]sensor index:11 hwmon /sys/class/hwmon/hwmon5/temp4_input Async:0 [1679865920][INFO]sensor index:12 hwmon /sys/class/hwmon/hwmon5/temp11_input Async:0 [1679865920][INFO]sensor index:13 hwmon /sys/class/hwmon/hwmon5/temp8_input Async:0 [1679865920][INFO]sensor index:14 hwmon /sys/class/hwmon/hwmon5/temp1_input Async:0 [1679865920][INFO]sensor index:15 hwmon /sys/class/hwmon/hwmon5/temp5_input Async:0 [1679865920][INFO]sensor index:16 hwmon /sys/class/hwmon/hwmon5/temp12_input Async:0 [1679865920][INFO]sensor index:17 hwmon /sys/class/hwmon/hwmon5/temp9_input Async:0 [1679865920][INFO]sensor index:18 hwmon /sys/class/hwmon/hwmon5/temp2_input Async:0 [1679865920][INFO]thd_read_default_cooling devices loaded 18 cdevs [1679865920][INFO]ppcc limits max:28000000 min:125000 min_win:28000000 step:500000 [1679865920][INFO]set_pid_param 18 [-1000.100,10] [1679865920][INFO]Use Default pstate drv settings [1679865920][INFO]name = package-0 [1679865920][INFO]name = core [1679865920][INFO]name = uncore [1679865920][INFO]INT3400 Base path is /sys/bus/acpi/devices/INTC1041:00/physical_node/uuids/ [1679865920][INFO]Passive 1 UUID is not present, hence ignore _TRT, as it may have junk!! [1679865920][MSG]Config file /etc/thermald/thermal-conf.xml does not exist [1679865920][INFO]13: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]1: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]11: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]8: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]6: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]16: intel_powerclamp, C:-1 MN: 0 MX:50 ST:5 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]4: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]14: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]2: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]12: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]0: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]10: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]9: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]7: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]17: TCC, C:10 MN: 0 MX:63 ST:1 pt:/sys/class/thermal/ rd_bk 1 [1679865920][INFO]5: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]15: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]3: Processor, C:0 MN: 0 MX:0 ST:1 pt:/sys/class/thermal/ rd_bk 0 [1679865920][INFO]18: rapl_controller, C:28000000 MN: 28000000 MX:125000 Inc ST:-1000000 Dec ST:-500000 pt:/sys/devices/virtual/powercap/intel-rapl/intel-rapl:0/ rd_bk 1 [1679865920][INFO]19: intel_pstate, C:0 MN: 0 MX:10 ST:1 pt:/sys/devices/system/cpu/intel_pstate/ rd_bk 1 [1679865920][INFO]20: LCD, C:0 MN: 0 MX:96000 ST:9600 pt:/sys/class/backlight/intel_backlight/ rd_bk 1 [1679865920][INFO]thd_read_default_thermal_zones loaded 6 zones [1679865920][INFO]INT3400 Base path is /sys/bus/acpi/devices/INTC1041:00/physical_node/uuids/ [1679865920][INFO]Processor thermal device is present [1679865920][INFO]It will act as CPU thermal zone !! [1679865920][INFO]Processor thermal device passive Trip is 90000 [1679865920][INFO]min:0 max:0 [1679865920][INFO]min:0 max:0 [1679865920][INFO]min:0 max:0 [1679865920][INFO]min:0 max:0 [1679865920][INFO]INT3400 Base path is /sys/bus/acpi/devices/INTC1041:00/physical_node/uuids/ [1679865920][INFO]Passive 1 UUID is not present, hence ignore _TRT, as it may have junk!! [1679865920][MSG]Config file /etc/thermald/thermal-conf.xml does not exist [1679865920][INFO] ZONE DUMP BEGIN [1679865920][INFO] [1679865920][INFO]Zone 2: TCPU, Active:1 Bind:1 Sensor_cnt:1 [1679865920][INFO]..sensors.. [1679865920][INFO]sensor index:2 TCPU /sys/class/thermal/thermal_zone2/ Async:0 [1679865920][INFO]..trips.. [1679865920][INFO]index 7: type:active temp:103050 hyst:1000 zone id:2 sensor id:2 control_type:1 cdev size:0 [1679865920][INFO]index 6: type:active temp:104550 hyst:1000 zone id:2 sensor id:2 control_type:1 cdev size:0 [1679865920][INFO]index 5: type:active temp:106050 hyst:1000 zone id:2 sensor id:2 control_type:1 cdev size:0 [1679865920][INFO]index 4: type:active temp:107050 hyst:1000 zone id:2 sensor id:2 control_type:1 cdev size:0 [1679865920][INFO]index 3: type:active temp:109050 hyst:1000 zone id:2 sensor id:2 control_type:1 cdev size:0 [1679865920][INFO]index 0: type:critical temp:110050 hyst:1000 zone id:2 sensor id:2 control_type:1 cdev size:0 [1679865920][INFO]index 1: type:max temp:110050 hyst:1000 zone id:2 sensor id:2 control_type:1 cdev size:0 [1679865920][INFO]index 2: type:passive temp:90000 hyst:1000 zone id:2 sensor id:2 control_type:1 cdev size:4 [1679865920][INFO]cdev[0] rapl_controller, Sampling period: 0 [1679865920][INFO] target_state:not defined [1679865920][INFO]min_max 0 [1679865920][INFO]cdev[1] intel_pstate, Sampling period: 0 [1679865920][INFO] target_state:not defined [1679865920][INFO]min_max 0 [1679865920][INFO]cdev[2] intel_powerclamp, Sampling period: 0 [1679865920][INFO] target_state:not defined [1679865920][INFO]min_max 0 [1679865920][INFO]cdev[3] Processor, Sampling period: 0 [1679865920][INFO] target_state:not defined [1679865920][INFO]min_max 0 [1679865920][INFO]index 8: type:polling temp:92745 hyst:0 zone id:2 sensor id:2 control_type:0 cdev size:0 [1679865920][INFO] [1679865920][INFO] ZONE DUMP END [1679865920][INFO]Running on a vanilla kernel [1679865920][MSG]Polling mode is enabled: 4 [1679865920][INFO]Current user preference is 0 [1679865920][INFO]thd_engine_thread begin [1679865924][INFO]op->device:Processor -1 [1679865924][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865924][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-1, max_state:0 [1679865928][INFO]op->device:Processor -2 [1679865928][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865928][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-2, max_state:0 [1679865932][INFO]op->device:Processor -3 [1679865932][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865932][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-3, max_state:0 [1679865936][INFO]op->device:Processor -4 [1679865936][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865936][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-4, max_state:0 [1679865940][INFO]op->device:Processor -5 [1679865940][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865940][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-5, max_state:0 [1679865944][INFO]op->device:Processor -6 [1679865944][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865944][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-6, max_state:0 [1679865948][INFO]op->device:Processor -7 [1679865948][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865948][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-7, max_state:0 [1679865952][INFO]op->device:Processor -8 [1679865952][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865952][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-8, max_state:0 [1679865956][INFO]op->device:Processor -9 [1679865956][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865956][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-9, max_state:0 [1679865960][INFO]op->device:Processor -10 [1679865960][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865960][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-10, max_state:0 [1679865964][INFO]op->device:Processor -11 [1679865964][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865964][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-11, max_state:0 [1679865968][INFO]op->device:Processor -12 [1679865968][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865968][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-12, max_state:0 [1679865972][INFO]op->device:Processor -13 [1679865972][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865972][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-13, max_state:0 [1679865976][INFO]op->device:Processor -14 [1679865976][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865976][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-14, max_state:0 [1679865980][INFO]op->device:Processor -15 [1679865980][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865980][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-15, max_state:0 [1679865984][INFO]op->device:Processor -16 [1679865984][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865984][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-16, max_state:0 [1679865988][INFO]op->device:Processor -17 [1679865988][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865988][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-17, max_state:0 [1679865992][INFO]op->device:Processor -18 [1679865992][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865992][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-18, max_state:0 [1679865996][INFO]op->device:Processor -19 [1679865996][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679865996][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-19, max_state:0 [1679866000][INFO]op->device:Processor -20 [1679866000][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866000][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-20, max_state:0 [1679866004][INFO]op->device:Processor -21 [1679866004][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866004][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-21, max_state:0 [1679866008][INFO]op->device:Processor -22 [1679866008][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866008][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-22, max_state:0 [1679866012][INFO]op->device:Processor -23 [1679866012][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866012][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-23, max_state:0 [1679866016][INFO]op->device:Processor -24 [1679866016][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866016][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-24, max_state:0 [1679866020][INFO]op->device:Processor -25 [1679866020][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866020][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-25, max_state:0 [1679866024][INFO]op->device:Processor -26 [1679866024][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866024][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-26, max_state:0 [1679866028][INFO]op->device:Processor -27 [1679866028][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866028][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-27, max_state:0 [1679866032][INFO]op->device:Processor -28 [1679866032][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866032][INFO]Set : threshold:90000, temperature:39050, cdev:13(Processor), curr_state:-28, max_state:0 [1679866036][INFO]op->device:Processor -29 [1679866036][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866036][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-29, max_state:0 [1679866040][INFO]op->device:Processor -30 [1679866040][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866040][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-30, max_state:0 [1679866044][INFO]op->device:Processor -31 [1679866044][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866044][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-31, max_state:0 [1679866048][INFO]op->device:Processor -32 [1679866048][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866048][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-32, max_state:0 [1679866052][INFO]op->device:Processor -33 [1679866052][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866052][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-33, max_state:0 [1679866056][INFO]op->device:Processor -34 [1679866056][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866056][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-34, max_state:0 [1679866060][INFO]op->device:Processor -35 [1679866060][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866060][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-35, max_state:0 [1679866064][INFO]op->device:Processor -36 [1679866064][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866064][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-36, max_state:0 [1679866068][INFO]op->device:Processor -37 [1679866068][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866068][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-37, max_state:0 [1679866072][INFO]op->device:Processor -38 [1679866072][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866072][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-38, max_state:0 [1679866076][INFO]op->device:Processor -39 [1679866076][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866076][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-39, max_state:0 [1679866080][INFO]op->device:Processor -40 [1679866080][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866080][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-40, max_state:0 [1679866084][INFO]op->device:Processor -41 [1679866084][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866084][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-41, max_state:0 [1679866088][INFO]op->device:Processor -42 [1679866088][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866088][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-42, max_state:0 [1679866092][INFO]op->device:Processor -43 [1679866092][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866092][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-43, max_state:0 [1679866096][INFO]op->device:Processor -44 [1679866096][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866096][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-44, max_state:0 [1679866100][INFO]op->device:Processor -45 [1679866100][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866100][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-45, max_state:0 [1679866104][INFO]op->device:Processor -46 [1679866104][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866104][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-46, max_state:0 [1679866108][INFO]op->device:Processor -47 [1679866108][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866108][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-47, max_state:0 [1679866112][INFO]op->device:Processor -48 [1679866112][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866112][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-48, max_state:0 [1679866116][INFO]op->device:Processor -49 [1679866116][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866116][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-49, max_state:0 [1679866120][INFO]op->device:Processor -50 [1679866120][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866120][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-50, max_state:0 [1679866124][INFO]op->device:Processor -51 [1679866124][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866124][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-51, max_state:0 [1679866128][INFO]op->device:Processor -52 [1679866128][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866128][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-52, max_state:0 [1679866132][INFO]op->device:Processor -53 [1679866132][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866132][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-53, max_state:0 [1679866136][INFO]op->device:Processor -54 [1679866136][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866136][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-54, max_state:0 [1679866140][INFO]op->device:Processor -55 [1679866140][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866140][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-55, max_state:0 [1679866144][INFO]op->device:Processor -56 [1679866144][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866144][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-56, max_state:0 [1679866148][INFO]op->device:Processor -57 [1679866148][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866148][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-57, max_state:0 [1679866152][INFO]op->device:Processor -58 [1679866152][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866152][INFO]Set : threshold:90000, temperature:34050, cdev:13(Processor), curr_state:-58, max_state:0 [1679866156][INFO]op->device:Processor -59 [1679866156][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866156][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-59, max_state:0 [1679866160][INFO]op->device:Processor -60 [1679866160][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866160][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-60, max_state:0 [1679866164][INFO]op->device:Processor -61 [1679866164][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866164][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-61, max_state:0 [1679866168][INFO]op->device:Processor -62 [1679866168][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866168][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-62, max_state:0 [1679866172][INFO]op->device:Processor -63 [1679866172][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866172][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-63, max_state:0 [1679866176][INFO]op->device:Processor -64 [1679866176][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866176][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-64, max_state:0 [1679866180][INFO]op->device:Processor -65 [1679866180][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866180][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-65, max_state:0 [1679866184][INFO]op->device:Processor -66 [1679866184][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866184][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-66, max_state:0 [1679866188][INFO]op->device:Processor -67 [1679866188][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866188][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-67, max_state:0 [1679866192][INFO]op->device:Processor -68 [1679866192][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866192][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-68, max_state:0 [1679866196][INFO]op->device:Processor -69 [1679866196][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866196][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-69, max_state:0 [1679866200][INFO]op->device:Processor -70 [1679866200][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866200][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-70, max_state:0 [1679866204][INFO]op->device:Processor -71 [1679866204][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866204][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-71, max_state:0 [1679866208][INFO]op->device:Processor -72 [1679866208][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866208][INFO]Set : threshold:90000, temperature:37050, cdev:13(Processor), curr_state:-72, max_state:0 [1679866212][INFO]op->device:Processor -73 [1679866212][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866212][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-73, max_state:0 [1679866216][INFO]op->device:Processor -74 [1679866216][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866216][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-74, max_state:0 [1679866220][INFO]op->device:Processor -75 [1679866220][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866220][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-75, max_state:0 [1679866224][INFO]op->device:Processor -76 [1679866224][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866224][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-76, max_state:0 [1679866228][INFO]op->device:Processor -77 [1679866228][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866228][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-77, max_state:0 [1679866232][INFO]op->device:Processor -78 [1679866232][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866232][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-78, max_state:0 [1679866236][INFO]op->device:Processor -79 [1679866236][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866236][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-79, max_state:0 [1679866240][INFO]op->device:Processor -80 [1679866240][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866240][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-80, max_state:0 [1679866244][INFO]op->device:Processor -81 [1679866244][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866244][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-81, max_state:0 [1679866248][INFO]op->device:Processor -82 [1679866248][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866248][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-82, max_state:0 [1679866252][INFO]op->device:Processor -83 [1679866252][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866252][INFO]Set : threshold:90000, temperature:39050, cdev:13(Processor), curr_state:-83, max_state:0 [1679866256][INFO]op->device:Processor -84 [1679866256][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866256][INFO]Set : threshold:90000, temperature:38050, cdev:13(Processor), curr_state:-84, max_state:0 [1679866260][INFO]op->device:Processor -85 [1679866260][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866260][INFO]Set : threshold:90000, temperature:40050, cdev:13(Processor), curr_state:-85, max_state:0 [1679866264][INFO]op->device:Processor -86 [1679866264][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866264][INFO]Set : threshold:90000, temperature:42050, cdev:13(Processor), curr_state:-86, max_state:0 [1679866269][INFO]op->device:Processor -87 [1679866269][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866269][INFO]Set : threshold:90000, temperature:41050, cdev:13(Processor), curr_state:-87, max_state:0 [1679866273][INFO]op->device:Processor -88 [1679866273][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866273][INFO]Set : threshold:90000, temperature:40050, cdev:13(Processor), curr_state:-88, max_state:0 [1679866277][INFO]op->device:Processor -89 [1679866277][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866277][INFO]Set : threshold:90000, temperature:38050, cdev:13(Processor), curr_state:-89, max_state:0 [1679866281][INFO]op->device:Processor -90 [1679866281][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866281][INFO]Set : threshold:90000, temperature:38050, cdev:13(Processor), curr_state:-90, max_state:0 [1679866285][INFO]op->device:Processor -91 [1679866285][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866285][INFO]Set : threshold:90000, temperature:38050, cdev:13(Processor), curr_state:-91, max_state:0 [1679866289][INFO]op->device:Processor -92 [1679866289][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866289][INFO]Set : threshold:90000, temperature:38050, cdev:13(Processor), curr_state:-92, max_state:0 [1679866293][INFO]op->device:Processor -93 [1679866293][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866293][INFO]Set : threshold:90000, temperature:39050, cdev:13(Processor), curr_state:-93, max_state:0 [1679866297][INFO]op->device:Processor -94 [1679866297][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866297][INFO]Set : threshold:90000, temperature:38050, cdev:13(Processor), curr_state:-94, max_state:0 [1679866301][INFO]op->device:Processor -95 [1679866301][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866301][INFO]Set : threshold:90000, temperature:53050, cdev:13(Processor), curr_state:-95, max_state:0 [1679866305][INFO]op->device:Processor -96 [1679866305][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866305][INFO]Set : threshold:90000, temperature:53050, cdev:13(Processor), curr_state:-96, max_state:0 [1679866309][INFO]op->device:Processor -97 [1679866309][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866309][INFO]Set : threshold:90000, temperature:40050, cdev:13(Processor), curr_state:-97, max_state:0 [1679866313][INFO]op->device:Processor -98 [1679866313][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866313][INFO]Set : threshold:90000, temperature:40050, cdev:13(Processor), curr_state:-98, max_state:0 [1679866317][INFO]op->device:Processor -99 [1679866317][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866317][INFO]Set : threshold:90000, temperature:75050, cdev:13(Processor), curr_state:-99, max_state:0 [1679866321][INFO]op->device:Processor -100 [1679866321][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866321][INFO]Set : threshold:90000, temperature:45050, cdev:13(Processor), curr_state:-100, max_state:0 [1679866325][INFO]op->device:Processor -101 [1679866325][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866325][INFO]Set : threshold:90000, temperature:46050, cdev:13(Processor), curr_state:-101, max_state:0 [1679866329][INFO]op->device:Processor -102 [1679866329][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866329][INFO]Set : threshold:90000, temperature:47050, cdev:13(Processor), curr_state:-102, max_state:0 [1679866333][INFO]op->device:Processor -103 [1679866333][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866333][INFO]Set : threshold:90000, temperature:47050, cdev:13(Processor), curr_state:-103, max_state:0 [1679866337][INFO]op->device:Processor -104 [1679866337][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866337][INFO]Set : threshold:90000, temperature:49050, cdev:13(Processor), curr_state:-104, max_state:0 [1679866341][INFO]op->device:Processor -105 [1679866341][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866341][INFO]Set : threshold:90000, temperature:56050, cdev:13(Processor), curr_state:-105, max_state:0 [1679866345][INFO]Added zone 2 trip 2 clamp_valid 0 clamp 0 _min:0 _max:0 [1679866345][INFO]set cdev state index 18 state 11712000 wr:11712000 [1679866345][INFO]Set : threshold:90000, temperature:90050, cdev:18(rapl_controller), curr_state:11712000, max_state:125000 [1679866349][INFO]op->device:Processor -106 [1679866349][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866349][INFO]Set : threshold:90000, temperature:53050, cdev:13(Processor), curr_state:-106, max_state:0 [1679866353][INFO]op->device:Processor -107 [1679866353][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866353][INFO]Set : threshold:90000, temperature:51050, cdev:13(Processor), curr_state:-107, max_state:0 [1679866357][INFO]op->device:Processor -108 [1679866357][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866357][INFO]Set : threshold:90000, temperature:51050, cdev:13(Processor), curr_state:-108, max_state:0 [1679866361][INFO]op->device:Processor -109 [1679866361][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866361][INFO]Set : threshold:90000, temperature:51050, cdev:13(Processor), curr_state:-109, max_state:0 [1679866365][INFO]op->device:Processor -110 [1679866365][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866365][INFO]Set : threshold:90000, temperature:51050, cdev:13(Processor), curr_state:-110, max_state:0 [1679866369][INFO]op->device:Processor -111 [1679866369][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866369][INFO]Set : threshold:90000, temperature:51050, cdev:13(Processor), curr_state:-111, max_state:0 [1679866373][INFO]op->device:Processor -112 [1679866373][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866373][INFO]Set : threshold:90000, temperature:49050, cdev:13(Processor), curr_state:-112, max_state:0 [1679866377][INFO]op->device:Processor -113 [1679866377][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866377][INFO]Set : threshold:90000, temperature:49050, cdev:13(Processor), curr_state:-113, max_state:0 [1679866381][INFO]op->device:Processor -114 [1679866381][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866381][INFO]Set : threshold:90000, temperature:51050, cdev:13(Processor), curr_state:-114, max_state:0 [1679866385][INFO]op->device:Processor -115 [1679866385][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866385][INFO]Set : threshold:90000, temperature:48050, cdev:13(Processor), curr_state:-115, max_state:0 [1679866389][INFO]op->device:Processor -116 [1679866389][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866389][INFO]Set : threshold:90000, temperature:54050, cdev:13(Processor), curr_state:-116, max_state:0 [1679866393][INFO]op->device:Processor -117 [1679866393][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866393][INFO]Set : threshold:90000, temperature:52050, cdev:13(Processor), curr_state:-117, max_state:0 [1679866397][INFO]op->device:Processor -118 [1679866397][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866397][INFO]Set : threshold:90000, temperature:51050, cdev:13(Processor), curr_state:-118, max_state:0 [1679866401][INFO]op->device:Processor -119 [1679866401][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866401][INFO]Set : threshold:90000, temperature:51050, cdev:13(Processor), curr_state:-119, max_state:0 [1679866405][INFO]op->device:Processor -120 [1679866405][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866405][INFO]Set : threshold:90000, temperature:49050, cdev:13(Processor), curr_state:-120, max_state:0 [1679866409][INFO]op->device:Processor -121 [1679866409][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866409][INFO]Set : threshold:90000, temperature:49050, cdev:13(Processor), curr_state:-121, max_state:0 [1679866413][INFO]op->device:Processor -122 [1679866413][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866413][INFO]Set : threshold:90000, temperature:48050, cdev:13(Processor), curr_state:-122, max_state:0 [1679866417][INFO]op->device:Processor -123 [1679866417][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866417][INFO]Set : threshold:90000, temperature:48050, cdev:13(Processor), curr_state:-123, max_state:0 [1679866421][INFO]op->device:Processor -124 [1679866421][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866421][INFO]Set : threshold:90000, temperature:47050, cdev:13(Processor), curr_state:-124, max_state:0 [1679866425][INFO]op->device:Processor -125 [1679866425][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866425][INFO]Set : threshold:90000, temperature:53050, cdev:13(Processor), curr_state:-125, max_state:0 [1679866429][INFO]op->device:Processor -126 [1679866429][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866429][INFO]Set : threshold:90000, temperature:47050, cdev:13(Processor), curr_state:-126, max_state:0 [1679866433][INFO]op->device:Processor -127 [1679866433][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866433][INFO]Set : threshold:90000, temperature:46050, cdev:13(Processor), curr_state:-127, max_state:0 [1679866437][INFO]op->device:Processor -128 [1679866437][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866437][INFO]Set : threshold:90000, temperature:47050, cdev:13(Processor), curr_state:-128, max_state:0 [1679866441][INFO]op->device:Processor -129 [1679866441][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866441][INFO]Set : threshold:90000, temperature:46050, cdev:13(Processor), curr_state:-129, max_state:0 [1679866445][INFO]op->device:Processor -130 [1679866445][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866445][INFO]Set : threshold:90000, temperature:46050, cdev:13(Processor), curr_state:-130, max_state:0 [1679866449][INFO]op->device:Processor -131 [1679866449][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866449][INFO]Set : threshold:90000, temperature:46050, cdev:13(Processor), curr_state:-131, max_state:0 [1679866453][INFO]op->device:Processor -132 [1679866453][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866453][INFO]Set : threshold:90000, temperature:46050, cdev:13(Processor), curr_state:-132, max_state:0 [1679866457][INFO]op->device:Processor -133 [1679866457][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866457][INFO]Set : threshold:90000, temperature:46050, cdev:13(Processor), curr_state:-133, max_state:0 [1679866461][INFO]op->device:Processor -134 [1679866461][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866461][INFO]Set : threshold:90000, temperature:44050, cdev:13(Processor), curr_state:-134, max_state:0 [1679866465][INFO]op->device:Processor -135 [1679866465][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866465][INFO]Set : threshold:90000, temperature:45050, cdev:13(Processor), curr_state:-135, max_state:0 [1679866469][INFO]op->device:Processor -136 [1679866469][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866469][INFO]Set : threshold:90000, temperature:46050, cdev:13(Processor), curr_state:-136, max_state:0 [1679866473][INFO]op->device:Processor -137 [1679866473][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866473][INFO]Set : threshold:90000, temperature:46050, cdev:13(Processor), curr_state:-137, max_state:0 [1679866477][INFO]op->device:Processor -138 [1679866477][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866477][INFO]Set : threshold:90000, temperature:45050, cdev:13(Processor), curr_state:-138, max_state:0 [1679866481][INFO]op->device:Processor -139 [1679866481][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866481][INFO]Set : threshold:90000, temperature:46050, cdev:13(Processor), curr_state:-139, max_state:0 [1679866485][INFO]op->device:Processor -140 [1679866485][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866485][INFO]Set : threshold:90000, temperature:46050, cdev:13(Processor), curr_state:-140, max_state:0 [1679866489][INFO]op->device:Processor -141 [1679866489][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866489][INFO]Set : threshold:90000, temperature:45050, cdev:13(Processor), curr_state:-141, max_state:0 [1679866493][INFO]op->device:Processor -142 [1679866493][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866493][INFO]Set : threshold:90000, temperature:45050, cdev:13(Processor), curr_state:-142, max_state:0 [1679866497][INFO]op->device:Processor -143 [1679866497][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866497][INFO]Set : threshold:90000, temperature:45050, cdev:13(Processor), curr_state:-143, max_state:0 [1679866501][INFO]op->device:Processor -144 [1679866501][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866501][INFO]Set : threshold:90000, temperature:46050, cdev:13(Processor), curr_state:-144, max_state:0 [1679866505][INFO]op->device:Processor -145 [1679866505][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866505][INFO]Set : threshold:90000, temperature:44050, cdev:13(Processor), curr_state:-145, max_state:0 [1679866509][INFO]op->device:Processor -146 [1679866509][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866509][INFO]Set : threshold:90000, temperature:44050, cdev:13(Processor), curr_state:-146, max_state:0 [1679866513][INFO]op->device:Processor -147 [1679866513][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866513][INFO]Set : threshold:90000, temperature:44050, cdev:13(Processor), curr_state:-147, max_state:0 [1679866517][INFO]op->device:Processor -148 [1679866517][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866517][INFO]Set : threshold:90000, temperature:45050, cdev:13(Processor), curr_state:-148, max_state:0 [1679866521][INFO]op->device:Processor -149 [1679866521][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866521][INFO]Set : threshold:90000, temperature:45050, cdev:13(Processor), curr_state:-149, max_state:0 [1679866525][INFO]op->device:Processor -150 [1679866525][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866525][INFO]Set : threshold:90000, temperature:44050, cdev:13(Processor), curr_state:-150, max_state:0 [1679866529][INFO]op->device:Processor -151 [1679866529][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866529][INFO]Set : threshold:90000, temperature:43050, cdev:13(Processor), curr_state:-151, max_state:0 [1679866533][INFO]op->device:Processor -152 [1679866533][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866533][INFO]Set : threshold:90000, temperature:43050, cdev:13(Processor), curr_state:-152, max_state:0 [1679866537][INFO]op->device:Processor -153 [1679866537][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866537][INFO]Set : threshold:90000, temperature:43050, cdev:13(Processor), curr_state:-153, max_state:0 [1679866541][INFO]op->device:Processor -154 [1679866541][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866541][INFO]Set : threshold:90000, temperature:43050, cdev:13(Processor), curr_state:-154, max_state:0 [1679866545][INFO]op->device:Processor -155 [1679866545][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866545][INFO]Set : threshold:90000, temperature:43050, cdev:13(Processor), curr_state:-155, max_state:0 [1679866549][INFO]op->device:Processor -156 [1679866549][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866549][INFO]Set : threshold:90000, temperature:44050, cdev:13(Processor), curr_state:-156, max_state:0 [1679866553][INFO]op->device:Processor -157 [1679866553][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679866553][INFO]Set : threshold:90000, temperature:44050, cdev:13(Processor), curr_state:-157, max_state:0 [1679895110][INFO]op->device:Processor -158 [1679895110][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895110][INFO]Set : threshold:90000, temperature:24050, cdev:13(Processor), curr_state:-158, max_state:0 [1679895114][INFO]op->device:Processor -159 [1679895114][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895114][INFO]Set : threshold:90000, temperature:65050, cdev:13(Processor), curr_state:-159, max_state:0 [1679895118][INFO]op->device:Processor -160 [1679895118][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895118][INFO]Set : threshold:90000, temperature:87050, cdev:13(Processor), curr_state:-160, max_state:0 [1679895122][INFO]op->device:Processor -161 [1679895122][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895122][INFO]Set : threshold:90000, temperature:39050, cdev:13(Processor), curr_state:-161, max_state:0 [1679895126][INFO]op->device:Processor -162 [1679895126][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895126][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-162, max_state:0 [1679895130][INFO]op->device:Processor -163 [1679895130][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895130][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-163, max_state:0 [1679895134][INFO]op->device:Processor -164 [1679895134][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895134][INFO]Set : threshold:90000, temperature:36050, cdev:13(Processor), curr_state:-164, max_state:0 [1679895138][INFO]op->device:Processor -165 [1679895138][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895138][INFO]Set : threshold:90000, temperature:35050, cdev:13(Processor), curr_state:-165, max_state:0 [1679895142][INFO]op->device:Processor -166 [1679895142][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895142][INFO]Set : threshold:90000, temperature:34050, cdev:13(Processor), curr_state:-166, max_state:0 [1679895146][INFO]op->device:Processor -167 [1679895146][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895146][INFO]Set : threshold:90000, temperature:84050, cdev:13(Processor), curr_state:-167, max_state:0 [1679895150][INFO]op->device:Processor -168 [1679895150][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895150][INFO]Set : threshold:90000, temperature:81050, cdev:13(Processor), curr_state:-168, max_state:0 [1679895154][INFO]op->device:Processor -169 [1679895154][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895154][INFO]Set : threshold:90000, temperature:89050, cdev:13(Processor), curr_state:-169, max_state:0 [1679895158][INFO]op->device:Processor -170 [1679895158][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895158][INFO]Set : threshold:90000, temperature:58050, cdev:13(Processor), curr_state:-170, max_state:0 [1679895162][INFO]op->device:Processor -171 [1679895162][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895162][INFO]Set : threshold:90000, temperature:45050, cdev:13(Processor), curr_state:-171, max_state:0 [1679895166][INFO]op->device:Processor -172 [1679895166][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895166][INFO]Set : threshold:90000, temperature:42050, cdev:13(Processor), curr_state:-172, max_state:0 [1679895170][INFO]op->device:Processor -173 [1679895170][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state [1679895170][INFO]Set : threshold:90000, temperature:42050, cdev:13(Processor), curr_state:-173, max_state:0 ```

The interesting part in there is:

[1679866345][INFO]Added zone 2 trip 2 clamp_valid 0 clamp 0 _min:0 _max:0
[1679866345][INFO]set cdev state index 18 state 11712000 wr:11712000
[1679866345][INFO]Set : threshold:90000, temperature:90050, cdev:18(rapl_controller), curr_state:11712000, max_state:125000
$ ls -1 /sys/class/thermal/
cooling_device0@
cooling_device1@
cooling_device10@
cooling_device11@
cooling_device12@
cooling_device13@
cooling_device14@
cooling_device15@
cooling_device16@
cooling_device17@
cooling_device2@
cooling_device3@
cooling_device4@
cooling_device5@
cooling_device6@
cooling_device7@
cooling_device8@
cooling_device9@
thermal_zone0@
thermal_zone1@
thermal_zone2@
thermal_zone3@
thermal_zone4@
thermal_zone5@

$ cat /sys/class/thermal/cooling_device*/cur_state
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
-1
10
[pid  9479] openat(AT_FDCWD, "/sys/class/powercap/intel-rapl/intel-rapl:0/energy_uj", O_RDONLY) = 8
[pid  9479] read(8, "15514942154\n", 8191) = 12
[pid  9479] close(8)                    = 0
[pid  9479] openat(AT_FDCWD, "/sys/class/thermal/thermal_zone2/temp", O_RDONLY) = 8
[pid  9479] read(8, "37050\n", 8191)    = 6
[pid  9479] close(8)                    = 0
[pid  9479] newfstatat(AT_FDCWD, "/sys/class/thermal/cooling_device13/max_state", {st_mode=S_IFREG|0444, st_size=4096, ...}, 0) = 0
[pid  9479] openat(AT_FDCWD, "/sys/class/thermal/cooling_device13/max_state", O_RDONLY) = 8
[pid  9479] read(8, "0\n", 8191)        = 2
[pid  9479] close(8)                    = 0
[pid  9479] write(1, "[1679900493][INFO]op->device:Pro"..., 44) = 44
[pid  9479] newfstatat(AT_FDCWD, "/sys/class/thermal/cooling_device13/cur_state", {st_mode=S_IFREG|0644, st_size=4096, ...}, 0) = 0
[pid  9479] openat(AT_FDCWD, "/sys/class/thermal/cooling_device13/cur_state", O_WRONLY) = 8
[pid  9479] write(8, "-459", 4)         = -1 EINVAL (Invalid argument)
[pid  9479] write(1, "[1679900493][INFO]sysfs write fa"..., 83) = 83
[pid  9479] close(8)                    = 0
[pid  9479] write(1, "[1679900493][INFO]Set : threshol"..., 109) = 109
[pid  9479] newfstatat(AT_FDCWD, "/sys/class/thermal/cooling_device13/max_state", {st_mode=S_IFREG|0444, st_size=4096, ...}, 0) = 0
[pid  9479] openat(AT_FDCWD, "/sys/class/thermal/cooling_device13/max_state", O_RDONLY) = 8
[pid  9479] read(8, "0\n", 8191)        = 2
[pid  9479] close(8)                    = 0
[pid  9479] poll([{fd=6, events=POLLIN}], 1, 4000) = 0 (Timeout)
[39288.463998] ACPI Error: No handler for Region [XIN1] (00000000a644a137) [UserDefinedRegion] (20221020/evregion-130)
[39288.464010] ACPI Error: Region UserDefinedRegion (ID=143) has no handler (20221020/exfldio-261)
[39288.464013] ACPI Error: Aborting method \_SB.PC00.LPCB.LGEC.SEN1._TMP due to previous error (AE_NOT_EXIST) (20221020/psparse-529)
[39288.469019] ACPI Error: No handler for Region [XIN1] (00000000a644a137) [UserDefinedRegion] (20221020/evregion-130)
[39288.469024] ACPI Error: Region UserDefinedRegion (ID=143) has no handler (20221020/exfldio-261)
[39288.469027] ACPI Error: Aborting method \_SB.PC00.LPCB.LGEC.SEN1._TMP due to previous error (AE_NOT_EXIST) (20221020/psparse-529)
[39288.469032] thermal thermal_zone6: failed to read out thermal zone (-5)
[1679902221][INFO]Set : threshold:90000, temperature:53050, cdev:13(Processor), curr_state:-9, max_state:0
[1679902225][INFO]op->device:Processor -10
[1679902225][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state
[1679902225][INFO]Set : threshold:90000, temperature:54050, cdev:13(Processor), curr_state:-10, max_state:0
[1679902229][INFO]op->device:Processor -11
[1679902229][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state
[1679902229][INFO]Set : threshold:90000, temperature:53050, cdev:13(Processor), curr_state:-11, max_state:0
[1679902233][INFO]cdev index:18 consecutive call, increment exponentially state 6041000 (min 28000000 max 125000) (8041000:1)
[1679902233][INFO]set cdev state index 18 state 6041000 wr:6041000
[1679902233][INFO]Set : threshold:90000, temperature:90050, cdev:18(rapl_controller), curr_state:6041000, max_state:125000
[1679902237][INFO]op->device:Processor -12
[snip]
[1679903687][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state
[1679903687][INFO]Set : threshold:90000, temperature:47050, cdev:13(Processor), curr_state:-374, max_state:0
[1679903691][INFO]op->device:Processor -375
[1679903691][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state
[1679903691][INFO]Set : threshold:90000, temperature:45050, cdev:13(Processor), curr_state:-375, max_state:0
[1679903695][INFO]cdev index:18 consecutive call, increment exponentially state 4041000 (min 28000000 max 125000) (8041000:2)
[1679903695][INFO]set cdev state index 18 state 4041000 wr:4041000
[1679903695][INFO]Set : threshold:90000, temperature:90050, cdev:18(rapl_controller), curr_state:4041000, max_state:125000
[1679903699][INFO]op->device:Processor -376
[1679903699][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state
[1679903699][INFO]Set : threshold:90000, temperature:46050, cdev:13(Processor), curr_state:-376, max_state:0
[1679903703][INFO]op->device:Processor -377
[1679903703][INFO]sysfs write failed /sys/class/thermal/cooling_device13/cur_state
spandruvada commented 1 year ago

There is a missing implementation in LG Gram, where it needs help from LG engineers to implement a kernel driver to talk to firmware via ACPI op region. Without that I suggest don't run thermald on this system. It will result in too many errors. That's you have to disable INT3403.

Regarding the cooling device state, I think there was a kernel regression. Please try the latest upstream kernel. max_state can't be 0 and also curr_state can't be -1 at start.

lnicola commented 1 year ago

There is a missing implementation in LG Gram, where it needs help from LG engineers to implement a kernel driver to talk to firmware via ACPI op region. Without that I suggest don't run thermald on this system. It will result in too many errors. That's you have to disable INT3403.

Would it be possible to make thermald not fail catastrophically when the ACPI info is missing?

Regarding the cooling device state, I think there was a kernel regression. Please try the latest upstream kernel. max_state can't be 0 and also curr_state can't be -1 at start.

I can't easily compile a kernel right now, but it appears that both of those are 0 at boot, on the latest Fedora kernel (6.2.9-300.fc38.x86_64). thermald tries to decrease cur_state by 1 every couple of seconds.

scr4bble commented 5 months ago

I have been experiencing the same issue in past days/weeks. Temperatures are around 50-60 degrees Celsius, cpu frequency throttled down to 800 or 400 despite CPU utilization being 80-100%. Stopping/disabling/removing thermald (hack)-fixes the issue. Laptop: Lenovo T480. thermald --version: 2.5.6

I haven't had these thermal-throttling issues for years already and now they came back (with some update?). It started at some point in the past few weeks while being on Fedora 38. I have upgraded to Fedora 39 but the issue persisted. Now temporarily solved by removing thermald service. I am willing to provide more details for debugging if you want to look into it.

# systemctl status thermald.service 
● thermald.service - Thermal Daemon Service
     Loaded: loaded (/usr/lib/systemd/system/thermald.service; enabled; preset: enabled)
    Drop-In: /usr/lib/systemd/system/service.d
             └─10-timeout-abort.conf
     Active: active (running) since Fri 2024-02-23 22:56:58 CET; 9min ago
   Main PID: 9325 (thermald)
      Tasks: 5 (limit: 38274)
     Memory: 1.3M
     CGroup: /system.slice/thermald.service
             └─9325 /usr/sbin/thermald --systemd --dbus-enable --adaptive

Feb 23 22:56:57 lenovo-t480 systemd[1]: Starting thermald.service - Thermal Daemon Service...
Feb 23 22:56:57 lenovo-t480 thermald[9325]: 22 CPUID levels; family:model:stepping 0x6:8e:a (6:142:10)
Feb 23 22:56:58 lenovo-t480 thermald[9325]: 22 CPUID levels; family:model:stepping 0x6:8e:a (6:142:10)
Feb 23 22:56:58 lenovo-t480 thermald[9325]: sensor id 12 : No temp sysfs for reading raw temp
Feb 23 22:56:58 lenovo-t480 thermald[9325]: sensor id 12 : No temp sysfs for reading raw temp
Feb 23 22:56:58 lenovo-t480 thermald[9325]: sensor id 12 : No temp sysfs for reading raw temp
Feb 23 22:56:58 lenovo-t480 thermald[9325]: Polling mode is enabled: 4
Feb 23 22:56:58 lenovo-t480 systemd[1]: Started thermald.service - Thermal Daemon Service.
ejgallego commented 2 months ago

Hi folks, having similar issues to everyone on a Dell precision 5550 with Ubuntu 22.04, (kernel 6.9.3, thermald 2.5.6, Dell firmware 1.29.0).

Removing the int3403_thermal module restores performance back to regular levels.

So whatever is going on, still a problem. It's a pity, because when the kernel module is removed, the performance of the laptop is excellent (45W for hours is IMHO reasonable and likely what you get on windows)

@spandruvada let me know if you would like to see a thermald log.

spandruvada commented 1 month ago

You can run https://github.com/intel/thermal_daemon/blob/master/test/thermal-debug-dump-ubuntu.sh and send the tar file it generates.

spandruvada commented 1 month ago

Please always open new issues for the last 2 issues. This issue is about an MSI/LG Gram issue. If you add to old issues, I will miss them.

jy-lefort commented 1 month ago

I now use a Dell XPS 15 9530 (Core i7 13th gen) and thermald now seems to work as expected.

Thanks for your work.

ejgallego commented 1 month ago

Thanks @spandruvada ! I will do !