intel / thermal_daemon

Thermal daemon for IA
GNU General Public License v2.0
540 stars 117 forks source link

Haswell, thermald 1.5.3-1 ARCH - TRT/ART failed, sysfs read/write failed #85

Closed AM2petterk closed 7 years ago

AM2petterk commented 8 years ago

Arch x64

Linux-ck 4.3.6

Mar 07 12:27:22 petterk-linux systemd[1]: Starting Thermal Daemon Service...
Mar 07 12:27:22 petterk-linux systemd[1]: Started Thermal Daemon Service.
Mar 07 12:27:22 petterk-linux thermald[622]: 13 CPUID levels; family:model:stepping 0x6:45:1 (6:69:1)
Mar 07 12:27:22 petterk-linux thermald[622]: Polling mode is enabled: 4
Mar 07 12:27:22 petterk-linux thermald[622]:  failed to GET COUNT on /dev/acpi_thermal_rel
Mar 07 12:27:22 petterk-linux thermald[622]:  failed to GET COUNT on /dev/acpi_thermal_rel
Mar 07 12:27:22 petterk-linux thermald[622]: TRT/ART read failed
Mar 07 12:27:22 petterk-linux thermald[622]: sysfs read failed constraint_0_max_power_uw
Mar 07 12:27:22 petterk-linux thermald[622]: sysfs write failed trip_point_0_temp
spandruvada commented 8 years ago

On Mon, 2016-03-07 at 03:27 -0800, Petter K. wrote:

Arch x64 Linux-ck 4.3.6 Mar 07 12:27:22 petterk-linux systemd[1]: Starting Thermal Daemon Service... Mar 07 12:27:22 petterk-linux systemd[1]: Started Thermal Daemon Service. Mar 07 12:27:22 petterk-linux thermald[622]: 13 CPUID levels; family:model:stepping 0x6:45:1 (6:69:1) Mar 07 12:27:22 petterk-linux thermald[622]: Polling mode is enabled: 4 Mar 07 12:27:22 petterk-linux thermald[622]:  failed to GET COUNT on /dev/acpi_thermal_rel Mar 07 12:27:22 petterk-linux thermald[622]:  failed to GET COUNT on /dev/acpi_thermal_rel Mar 07 12:27:22 petterk-linux thermald[622]: TRT/ART read failed

This is not a problem. Not every system will have all thermal tables.

Mar 07 12:27:22 petterk-linux thermald[622]: sysfs read failed constraint_0_max_power_uw Mar 07 12:27:22 petterk-linux thermald[622]: sysfs write failed trip_point_0_temp

— Reply to this email directly or view it on GitHub.

mdolezel commented 8 years ago

I am running Ubuntu 15.10 on Lenovo ThinkPad T440p with i7 2600M, experiencing some overheating issues. Graph here (idle, Ubuntu 15.10, no GUI running) https://docs.google.com/spreadsheets/d/1-m4H1od3t9SF053wnC2aSKlQR1qu4UBFPdcL5ie0Pes/pubhtml

spandruvada commented 8 years ago

On Wed, 2016-04-13 at 03:01 -0700, Marek Doležel wrote:

I am running Ubuntu 15.10 on Lenovo ThinkPad T440p with i7 2600M, experiencing some overheating issues. Graph here (idle, Ubuntu 15.10, no GUI running) https://docs.google.com/spreadsheets/d/1-m4H1od3t9SF0 53wnC2aSKlQR1qu4UBFPdcL5ie0Pes/edit#gid=0 — Can you send my email? Can't access your document.

Thanks, Srinivas

You are receiving this because you commented. Reply to this email directly or view it on GitHub

mdolezel commented 8 years ago

@spandruvada i have edited the link and realized my issue may not be relevant to this one though. https://docs.google.com/spreadsheets/d/1-m4H1od3t9SF053wnC2aSKlQR1qu4UBFPdcL5ie0Pes/pubhtml ... I am unable to find your email, mine is marekdolezel at gmail dot com. Please let me know if I can provide you with any further information.

sheepdestroyer commented 8 years ago

On Lenovo X220, Intel Sandy Bridge core-i7 2620M Kernel 4.6-RC4 built with Intel Powerclamp activated, running with intel_pstate driver, Thermald 1.5.3

I get similar status and Thermald seems to do nothing ; how can I make sure thermald is actually doing anything? Nothing in logs even as temperature is going crazy.

$ sudo systemctl status thermald.service 
● thermald.service - Thermal Daemon Service
   Loaded: loaded (/usr/lib/systemd/system/thermald.service; disabled; vendor preset: disabled)
   Active: active (running) since Tue 2016-04-19 00:39:58 JST; 3min 58s ago
 Main PID: 15205 (thermald)
   CGroup: /system.slice/thermald.service
           └─15205 /usr/sbin/thermald --no-daemon --dbus-enable

Apr 19 00:39:58 sheepora thermald[15205]: Polling mode is enabled: 4
Apr 19 00:39:58 sheepora thermald[15205]: failed to open /dev/acpi_thermal_rel
Apr 19 00:39:58 sheepora thermald[15205]: failed to open /dev/acpi_thermal_rel
Apr 19 00:39:58 sheepora thermald[15205]: TRT/ART read failed
Apr 19 00:39:58 sheepora thermald[15205]: sysfs write failed enabled
Apr 19 00:39:58 sheepora systemd[1]: Started Thermal Daemon Service.
Apr 19 00:39:58 sheepora thermald[15205]: sysfs read failed max_brightness
Apr 19 00:39:58 sheepora thermald[15205]: sysfs write failed trip_point_0_temp
Apr 19 00:40:46 sheepora thermald[15205]: sysfs write failed enabled
Apr 19 00:40:46 sheepora thermald[15205]: sysfs write failed constraint_0_power_limit_uw
[sheepdestroyer@sheepora tools] $ cat /etc/thermald/thermal-conf.xml 
<?xml version="1.0"?>
<!--
use "man thermal-conf.xml" for details
-->
<!-- BEGIN -->
<ThermalConfiguration>
    <Platform>
        <Name>Lenovo ThinkPad X220</Name>
        <ProductName>*</ProductName>
        <Preference>QUIET</Preference>
        <ThermalZones>
            <ThermalZone>
                <Type>x86_pkg_temp</Type>
                <TripPoints>
                    <TripPoint>
                        <SensorType>x86_pkg_temp</SensorType>
                        <Temperature>60000</Temperature>
                        <type>passive</type>
                        <ControlType>SEQUENTIAL</ControlType>
                        <CoolingDevice>
                            <index>1</index>
                            <type>Fan_1</type>
                            <influence> 100 </influence>
                            <SamplingPeriod> 20 </SamplingPeriod>
                        </CoolingDevice>
                    </TripPoint>
                    <TripPoint>
                        <SensorType>x86_pkg_temp</SensorType>
                        <Temperature>70000</Temperature>
                        <type>passive</type>
                        <ControlType>SEQUENTIAL</ControlType>
                        <CoolingDevice>
                            <index>1</index>
                            <type>Fan_2</type>
                            <influence> 100 </influence>
                            <SamplingPeriod> 12 </SamplingPeriod>
                        </CoolingDevice>
                    </TripPoint>
                    <TripPoint>
                        <SensorType>x86_pkg_temp</SensorType>
                        <Temperature>80000</Temperature>
                        <type>passive</type>
                        <ControlType>SEQUENTIAL</ControlType>
                        <CoolingDevice>
                            <index>1</index>
                            <type>Fan_3</type>
                            <influence> 100 </influence>
                            <SamplingPeriod> 12 </SamplingPeriod>
                        </CoolingDevice>
                    </TripPoint>
                </TripPoints>
            </ThermalZone>
        </ThermalZones>
        <CoolingDevices>
            <CoolingDevice>
                <Type>Fan_1</Type>
                <Path>/sys/devices/platform/thinkpad_hwmon/pwm1</Path>
                <MinState>37</MinState>
                <IncDecStep>37</IncDecStep>
                <ReadBack> 0 </ReadBack>
                <MaxState>112</MaxState>
                <DebouncePeriod>5</DebouncePeriod>
            </CoolingDevice>
            <CoolingDevice>
                <Type>Fan_2</Type>
                <Path>/sys/devices/platform/thinkpad_hwmon/pwm1</Path>
                <MinState>112</MinState>
                <IncDecStep>37</IncDecStep>
                <ReadBack> 0 </ReadBack>
                <MaxState>149</MaxState>
                <DebouncePeriod>5</DebouncePeriod>
            </CoolingDevice>
            <CoolingDevice>
                <Type>Fan_3</Type>
                <Path>/sys/devices/platform/thinkpad_hwmon/pwm1</Path>
                <MinState>149</MinState>
                <IncDecStep>37</IncDecStep>
                <ReadBack> 0 </ReadBack>
                <MaxState>223</MaxState>
                <DebouncePeriod>5</DebouncePeriod>
            </CoolingDevice>
        </CoolingDevices>
    </Platform>
</ThermalConfiguration>
<!-- END -->
spandruvada commented 8 years ago

On Mon, 2016-04-18 at 08:57 -0700, sheepdestroyer wrote:

On Lenovo X220, Intel Sandy Bridge core-i7 2620M Kernel 4.6-RC4 built with Intel Powerclamp activated, running with intel_pstate driver, Thermald 1.5.3 I get similar status and Thermald seems to do nothing ; how can I make sure thermald is actually doing anything? Nothing in logs even as temperature is going crazy. $ sudo systemctl status thermald.service 

sudo systemctl stop thermald

goto command line

sudo thermald --no-daemon --loglevel=info

OR

sudo thermald --no-daemon --loglevel=debug

too much logging here.

Also, you should verify that the FAN control on your system is working You have specified

echo 100 > /sys/devices/platform/thinkpad_hwmon/pwm1

Does the above command is changing FAN speed? You should be able to hear FAN noise loud.

The failure message below are fine, your system doesn't have some ACPI tables for thermal, which would have generated the xml config file automatically.

● thermald.service - Thermal Daemon Service    Loaded: loaded (/usr/lib/systemd/system/thermald.service; disabled; vendor preset: disabled)    Active: active (running) since Tue 2016-04-19 00:39:58 JST; 3min 58s ago  Main PID: 15205 (thermald)    CGroup: /system.slice/thermald.service            └─15205 /usr/sbin/thermald --no-daemon --dbus-enable

Apr 19 00:39:58 sheepora thermald[15205]: Polling mode is enabled: 4 Apr 19 00:39:58 sheepora thermald[15205]: failed to open /dev/acpi_thermal_rel Apr 19 00:39:58 sheepora thermald[15205]: failed to open /dev/acpi_thermal_rel Apr 19 00:39:58 sheepora thermald[15205]: TRT/ART read failed Apr 19 00:39:58 sheepora thermald[15205]: sysfs write failed enabled Apr 19 00:39:58 sheepora systemd[1]: Started Thermal Daemon Service. Apr 19 00:39:58 sheepora thermald[15205]: sysfs read failed max_brightness Apr 19 00:39:58 sheepora thermald[15205]: sysfs write failed trip_point_0_temp Apr 19 00:40:46 sheepora thermald[15205]: sysfs write failed enabled Apr 19 00:40:46 sheepora thermald[15205]: sysfs write failed constraint_0_power_limit_uw [sheepdestroyer@sheepora tools] $ cat /etc/thermald/thermal-conf.xml  <?xml version="1.0"?>

             Lenovo ThinkPad X220         *         QUIET                                       x86_pkg_temp                                                               x86_pkg_temp                         60000                         passive                         SEQUENTIAL                                                      1                             Fan_1                              100                              20                                                                                            x86_pkg_temp                         70000                         passive                         SEQUENTIAL                                                      1                             Fan_2                              100                              12                                                                                            x86_pkg_temp                         80000                         passive                         SEQUENTIAL                                                      1                             Fan_3                              100                              12                                                                                                                            Fan_1                 /sys/devices/platform/thinkpad_hwmon/pwm1                 37 >                 37 >                  0 >                 112 >                 5 >              >              >                 Fan_2 >                 /sys/devices/platform/thinkpad_hwmon/pwm1 >                 112 >                 37 >                  0 >                 149 >                 5 >              >              >                 Fan_3 >                 /sys/devices/platform/thinkpad_hwmon/pwm1 >                 149 >                 37 >                  0 >                 223 >                 5 >              >          >      >

— You are receiving this because you were mentioned. Reply to this email directly or view it on GitHub