s-light / ASUS-ZenBook-Pro-Duo-UX581GV

ASUS ZenBook Pro Duo UX581GV - infos & helper scripts for running Kubuntu
MIT License
68 stars 8 forks source link

Fan Control #6

Open DemonicTutor opened 4 years ago

DemonicTutor commented 4 years ago

Hey!

Did you have any luck figuring out how to control the fans yet? I'm getting a lot of warnings regarding CPU throttling and i would love to avoid that:

mce: CPU4: Core temperature above threshold, cpu clock throttled (total events = 1655)

RobinVanCauter commented 4 years ago

Not sure if I have experienced the throttling myself yet up until now. I do get about 10 of these messages during boot, but haven't seen them afterwards. But I haven't checked after heavier workloads yet, because I never really had the feeling I was being throttled except when working on battery.

You could give the following a try:

https://wiki.archlinux.org/index.php/Fan_speed_control#NBFC

DemonicTutor commented 4 years ago

just having it standing around and already up to >12k events.

sensors doesn't display fans so i guess just NBFC will not do the trick.. trying sensors-detect right now.

RobinVanCauter commented 4 years ago

Just checked the result after having it just sitting there for a while, but on the lock screen and the events do seem to increase over time. After about 4 hours of lock screen it hit 175 total throttling events for me. Kind of weird, since it's not really doing anything and it's not like it feels hot.

Will check if there's any difference during heavy workloads (I expect there will be even more events)

DemonicTutor commented 4 years ago

yeah because it doesnt spin up the fans... run stress for example

s-light commented 4 years ago

hmpf - i also get this messages...

[ 1685.019810] mce: CPU14: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.019810] mce: CPU6: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.019857] mce: CPU9: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.019858] mce: CPU1: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.019859] mce: CPU8: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.019860] mce: CPU0: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.019861] mce: CPU5: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.019862] mce: CPU4: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.019864] mce: CPU11: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.019864] mce: CPU12: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.019865] mce: CPU13: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.019866] mce: CPU3: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.019867] mce: CPU2: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.019868] mce: CPU10: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.019870] mce: CPU7: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.019871] mce: CPU15: Package temperature above threshold, cpu clock throttled (total events = 1480)
[ 1685.020789] mce: CPU7: Package temperature/speed normal
[ 1685.020790] mce: CPU14: Package temperature/speed normal
[ 1685.020791] mce: CPU11: Package temperature/speed normal
[ 1685.020791] mce: CPU15: Package temperature/speed normal
[ 1685.020792] mce: CPU6: Package temperature/speed normal
[ 1685.020793] mce: CPU3: Package temperature/speed normal
[ 1685.020835] mce: CPU9: Package temperature/speed normal
[ 1685.020835] mce: CPU1: Package temperature/speed normal
[ 1685.020836] mce: CPU8: Package temperature/speed normal
[ 1685.020837] mce: CPU0: Package temperature/speed normal
[ 1685.020838] mce: CPU2: Package temperature/speed normal
[ 1685.020838] mce: CPU10: Package temperature/speed normal
[ 1685.020839] mce: CPU12: Package temperature/speed normal
[ 1685.020840] mce: CPU13: Package temperature/speed normal
[ 1685.020841] mce: CPU4: Package temperature/speed normal
[ 1685.020841] mce: CPU5: Package temperature/speed normal
[ 1777.338886] mce: CPU8: Core temperature above threshold, cpu clock throttled (total events = 48)
[ 1777.338886] mce: CPU0: Core temperature above threshold, cpu clock throttled (total events = 48)
[ 1777.339885] mce: CPU8: Core temperature/speed normal
[ 1777.339886] mce: CPU0: Core temperature/speed normal
[ 1996.510023] mce: CPU7: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.510024] mce: CPU15: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.510071] mce: CPU9: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.510072] mce: CPU8: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.510073] mce: CPU0: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.510074] mce: CPU1: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.510075] mce: CPU10: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.510076] mce: CPU2: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.510077] mce: CPU5: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.510078] mce: CPU13: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.510079] mce: CPU3: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.510080] mce: CPU14: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.510081] mce: CPU11: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.510081] mce: CPU6: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.510083] mce: CPU4: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.510084] mce: CPU12: Package temperature above threshold, cpu clock throttled (total events = 1541)
[ 1996.511031] mce: CPU15: Package temperature/speed normal
[ 1996.511031] mce: CPU7: Package temperature/speed normal
[ 1996.511032] mce: CPU8: Package temperature/speed normal
[ 1996.511032] mce: CPU0: Package temperature/speed normal
[ 1996.511033] mce: CPU2: Package temperature/speed normal
[ 1996.511034] mce: CPU10: Package temperature/speed normal
[ 1996.511075] mce: CPU1: Package temperature/speed normal
[ 1996.511076] mce: CPU9: Package temperature/speed normal
[ 1996.511077] mce: CPU3: Package temperature/speed normal
[ 1996.511077] mce: CPU11: Package temperature/speed normal
[ 1996.511078] mce: CPU4: Package temperature/speed normal
[ 1996.511079] mce: CPU12: Package temperature/speed normal
[ 1996.511080] mce: CPU6: Package temperature/speed normal
[ 1996.511081] mce: CPU5: Package temperature/speed normal
[ 1996.511081] mce: CPU14: Package temperature/speed normal
[ 1996.511082] mce: CPU13: Package temperature/speed normal
[ 2308.275287] mce: CPU12: Core temperature above threshold, cpu clock throttled (total events = 1184)
[ 2308.275288] mce: CPU4: Core temperature above threshold, cpu clock throttled (total events = 1184)
[ 2308.275289] mce: CPU3: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.275290] mce: CPU11: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.275291] mce: CPU4: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.275294] mce: CPU12: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.275380] mce: CPU10: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.275381] mce: CPU2: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.275418] mce: CPU9: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.275419] mce: CPU1: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.275420] mce: CPU0: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.275421] mce: CPU8: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.275423] mce: CPU13: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.275424] mce: CPU5: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.275425] mce: CPU15: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.275426] mce: CPU14: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.275427] mce: CPU7: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.275428] mce: CPU6: Package temperature above threshold, cpu clock throttled (total events = 1867)
[ 2308.276323] mce: CPU12: Core temperature/speed normal
[ 2308.276324] mce: CPU3: Package temperature/speed normal
[ 2308.276324] mce: CPU9: Package temperature/speed normal
[ 2308.276325] mce: CPU5: Package temperature/speed normal
[ 2308.276326] mce: CPU4: Core temperature/speed normal
[ 2308.276326] mce: CPU1: Package temperature/speed normal
[ 2308.276327] mce: CPU13: Package temperature/speed normal
[ 2308.276327] mce: CPU11: Package temperature/speed normal
[ 2308.276356] mce: CPU4: Package temperature/speed normal
[ 2308.276357] mce: CPU12: Package temperature/speed normal
[ 2308.276360] mce: CPU0: Package temperature/speed normal
[ 2308.276360] mce: CPU8: Package temperature/speed normal
[ 2308.276361] mce: CPU10: Package temperature/speed normal
[ 2308.276362] mce: CPU2: Package temperature/speed normal
[ 2308.276363] mce: CPU14: Package temperature/speed normal
[ 2308.276364] mce: CPU6: Package temperature/speed normal
[ 2308.276365] mce: CPU15: Package temperature/speed normal
[ 2308.276365] mce: CPU7: Package temperature/speed normal

but as you can tell from the time-stamps first it shows the [ 2308.275287] mce: CPU12: Core temperature above threshold, cpu clock throttled (total events = 1184) then [ 2308.276357] mce: CPU12: Package temperature/speed normal there are only ~1ms time difference.. and i think i have somewhere read that this is just a bug / wrong configuration.. (i don't know where this was - it showed up on my search what the message means)

DemonicTutor commented 4 years ago

do your fans spin up if you run stress -c 12 --timeout 10 ? my temps get up to 90, the fans do nothing and ofcourse the CPU gets throttled down because it would melt otherwise.

i had no time the past days to toy around... currently ArchLinux is installed and the fans didnt spin up either.. but i didnt toy around with kernel-modules or ACPI stuff yet...

RobinVanCauter commented 4 years ago

My fans spin up when I put any load on the CPU. Even just watching Youtube makes them spin quite loud. I do get the messages, but I never get such high temps as you are reporting, since my fans kick in whenever it starts to get hot.

Just out of the box behavior on my installation, so you should probably find out what PopOS is including by default that ArchLinux doesn't :D

If I can be of any assistance, let me know.

DemonicTutor commented 4 years ago

good to know! I have never before even heard of PopOS .. but i could give it a try tomorrow to see whats going on...

If you type sensors does it list tempratures and fan-RPM or just temps?

Maybe you can list you kernel-version and loaded modules? Can you check what you have installed regarding Power / Fan control ?

modules / packages like asus-nb-wmi asus-fan coretemp nbfc ?

content of /etc/fancontrol if you have that?

where are your fans under /sys/devices/platform ?

how does your grub config look like ? maybe anything regarding ACPI going on there?

Update: Well thats disappointing... i cant get Pop Os! to boot ... from my USB-Stick :/

DemonicTutor commented 4 years ago

Reddit - Linux on asus zenbook pro duo ux581gv

Nerdtome reports that the fans work for him using Ubuntu Budgie

s-light commented 4 years ago

for me the fan control is working with Kubuntu 19.10. if i do some cpu or gpu intensive tasks the fans start spinning up.. if i just serve the web or do nothing they are super quite..

i have Psensor running - it is reporting a bunch of values: image sensors is also reporting things:

$ sensors
coretemp-isa-0000
Adapter: ISA adapter
Package id 0:  +48.0°C  (high = +100.0°C, crit = +100.0°C)
Core 0:        +46.0°C  (high = +100.0°C, crit = +100.0°C)
Core 1:        +48.0°C  (high = +100.0°C, crit = +100.0°C)
Core 2:        +46.0°C  (high = +100.0°C, crit = +100.0°C)
Core 3:        +45.0°C  (high = +100.0°C, crit = +100.0°C)
Core 4:        +46.0°C  (high = +100.0°C, crit = +100.0°C)
Core 5:        +47.0°C  (high = +100.0°C, crit = +100.0°C)
Core 6:        +46.0°C  (high = +100.0°C, crit = +100.0°C)
Core 7:        +48.0°C  (high = +100.0°C, crit = +100.0°C)

BAT0-acpi-0
Adapter: ACPI interface
in0:         +15.85 V  

iwlwifi-virtual-0
Adapter: Virtual device
temp1:        +36.0°C  

pch_cannonlake-virtual-0
Adapter: Virtual device
temp1:        +44.0°C  

acpitz-acpi-0
Adapter: ACPI interface
temp1:        +46.0°C  (crit = +103.0°C)
RobinVanCauter commented 4 years ago

@DemonicTutor I had the same issue with PopOS not booting from USB. There is something wrong with the bundled grub inside the installer ISO's for some systems.

https://github.com/pop-os/pop/issues/730

I myself booted into a grub console of a second USB containing a regular Ubuntu installer, used a couple of commands to replace the ubuntu grub config with the grub config of the PopOS disk and was able to boot into the installer from there. Once installed there is no issues with booting at all.

This method was described here: https://www.reddit.com/r/pop_os/comments/c9rsv7/any_experience_installing_pop_os_on_asus_zenbook/f7o6kqg/?utm_source=share&utm_medium=web2x

aidankmcl commented 4 years ago

Hey all! I had a very similar problem and tested on Ubuntu as well as Pop! OS (which I also was unable to boot until I tried the solution commented by @T-Grave ) and was still getting high temperatures as well as occasionally shutting down (I assume from sustaining high temperature but not confirmed). I could hear the fans running but not speeding up despite increasing temperatures.

Today I installed https://github.com/dominiksalvet/asus-fan-control on Pop and I can hear the fans speeding up! I'll update here again if this isn't a successful long term fix but I'm optimistic :slightly_smiling_face:

Agrover112 commented 3 years ago

I installed Ubuntu 20.04 and no wifi receiving lol

s-light commented 3 years ago

@Agrover112 pleas keep the issues to one single topic. so it is easier to discuss and resolve them... i created a new one for your topic: #11

s-light commented 3 years ago

has anybody made any progress?

i just tried with sensors on my kubuntu 20.04:

$ sensors
iwlwifi_1-virtual-0
Adapter: Virtual device
temp1:        +33.0°C  

coretemp-isa-0000
Adapter: ISA adapter
Package id 0:  +41.0°C  (high = +100.0°C, crit = +100.0°C)
Core 0:        +41.0°C  (high = +100.0°C, crit = +100.0°C)
Core 1:        +41.0°C  (high = +100.0°C, crit = +100.0°C)
Core 2:        +41.0°C  (high = +100.0°C, crit = +100.0°C)
Core 3:        +39.0°C  (high = +100.0°C, crit = +100.0°C)
Core 4:        +40.0°C  (high = +100.0°C, crit = +100.0°C)
Core 5:        +41.0°C  (high = +100.0°C, crit = +100.0°C)
Core 6:        +40.0°C  (high = +100.0°C, crit = +100.0°C)
Core 7:        +40.0°C  (high = +100.0°C, crit = +100.0°C)

BAT0-acpi-0
Adapter: ACPI interface
in0:          15.85 V  

asus-isa-0000
Adapter: ISA adapter
cpu_fan:     1700 RPM

pch_cannonlake-virtual-0
Adapter: Virtual device
temp1:        +48.0°C  

acpitz-acpi-0
Adapter: ACPI interface
temp1:        +41.0°C  (crit = +103.0°C)

this lists the cpu_fan :-)

maybe somewhere was a missing config added so it shows up now?! (it did not in my test on Dec 29, 2019)

IanMichaelHarper commented 2 years ago

Any update on this?

I'm thinking of installing a linux distro on the Zenbook duo but this is probably the biggest thing holding me back - seems like an issue that could shorten the lifetime of the laptop hardware

s-light commented 2 years ago

for me i did only get these warnings - the fan did work as expected - if i did some really heavy work with the gpu or cpu - blender benchmars / rendering 3d graphics for example - the fan starts to blow and i can hear it/feel it :-) and if the work is over it gets back to near silent and all fine temperature. (for example just currently in a ~20°C room my CPU0..7 Cores are at ~43°C and CPU fan is reported with 1700RPM by psensor - and by sensors:

$ sensors
pch_cannonlake-virtual-0
Adapter: Virtual device
temp1:        +53.0°C  

asus-isa-0000
Adapter: ISA adapter
cpu_fan:     1700 RPM

BAT0-acpi-0
Adapter: ACPI interface
in0:          15.85 V  

coretemp-isa-0000
Adapter: ISA adapter
Package id 0:  +44.0°C  (high = +100.0°C, crit = +100.0°C)
Core 0:        +44.0°C  (high = +100.0°C, crit = +100.0°C)
Core 1:        +43.0°C  (high = +100.0°C, crit = +100.0°C)
Core 2:        +42.0°C  (high = +100.0°C, crit = +100.0°C)
Core 3:        +42.0°C  (high = +100.0°C, crit = +100.0°C)
Core 4:        +42.0°C  (high = +100.0°C, crit = +100.0°C)
Core 5:        +42.0°C  (high = +100.0°C, crit = +100.0°C)
Core 6:        +42.0°C  (high = +100.0°C, crit = +100.0°C)
Core 7:        +44.0°C  (high = +100.0°C, crit = +100.0°C)

iwlwifi_1-virtual-0
Adapter: Virtual device
temp1:        +37.0°C  

nvme-pci-3c00
Adapter: PCI adapter
Composite:    +31.9°C  (low  = -273.1°C, high = +80.8°C)
                       (crit = +81.8°C)
Sensor 1:     +31.9°C  (low  = -273.1°C, high = +65261.8°C)
Sensor 2:     +31.9°C  (low  = -273.1°C, high = +65261.8°C)

acpitz-acpi-0
Adapter: ACPI interface
temp1:        +46.0°C  (crit = +103.0°C)

so i think for me it is all fine!

and currently i also do not see any Package temperature above threshold, cpu clock throttled events anymore..

Operating System: Kubuntu 21.04
KDE Plasma Version: 5.21.4
KDE Frameworks Version: 5.80.0
Qt Version: 5.15.2
Kernel Version: 5.11.0-42-generic
OS Type: 64-bit
Graphics Platform: X11
Processors: 16 × Intel® Core™ i9-9980HK CPU @ 2.40GHz
Memory: 31,2 GiB of RAM
Graphics Processor: Mesa Intel® UHD Graphics 630
IanMichaelHarper commented 2 years ago

That is good news. I did a bit of googling and it seemed like the overheating/throttling was a pretty common issue. A lot of the results weren't even linux specific so I think it might be an issue with the laptop itself more so than the OS - reports of overheating/throttling while on linux may not be because of linux

s-light commented 2 years ago

today i saw some themral warnings in the logs - the were created exactly before the suspend..

 intel_pch_thermal 0000:00:12.0: CPU-PCH current temp [64C] higher than the threshold temp [50C], sleep 1 times for 100 ms duration
[323370.147742] intel_pch_thermal 0000:00:12.0: CPU-PCH current temp [64C] higher than the threshold temp [50C], sleep 2 times for 100 ms duration
[323370.255696] intel_pch_thermal 0000:00:12.0: CPU-PCH current temp [63C] higher than the threshold temp [50C], sleep 3 times for 100 ms duration
[323370.363983] intel_pch_thermal 0000:00:12.0: CPU-PCH current temp [62C] higher than the threshold temp [50C], sleep 4 times for 100 ms duration
[323370.472043] intel_pch_thermal 0000:00:12.0: CPU-PCH current temp [63C] higher than the threshold temp [50C], sleep 5 times for 100 ms duration
[323370.579735] intel_pch_thermal 0000:00:12.0: CPU-PCH current temp [63C] higher than the threshold temp [50C], sleep 6 times for 100 ms duration
[323370.688039] intel_pch_thermal 0000:00:12.0: CPU-PCH current temp [61C] higher than the threshold temp [50C], sleep 7 times for 100 ms duration
[323370.795725] intel_pch_thermal 0000:00:12.0: CPU-PCH current temp [63C] higher than the threshold temp [50C], sleep 8 times for 100 ms duration
[323370.904385] intel_pch_thermal 0000:00:12.0: CPU-PCH current temp [62C] higher than the threshold temp [50C], sleep 9 times for 100 ms duration
[323371.012253] intel_pch_thermal 0000:00:12.0: CPU-PCH current temp [63C] higher than the threshold temp [50C], sleep 10 times for 100 ms duration
[323371.120223] intel_pch_thermal 0000:00:12.0: CPU-PCH is hot [63C] even after delay, continue to suspend. S0ix might fail
[323371.158966] ACPI: EC: interrupt blocked
[323371.355023] asus_wmi: Unknown key ea pressed
[327821.407837] asus_wmi: Unknown key e9 pressed
....
[327823.341285] Restarting tasks ... done.
[327823.350744] thermal thermal_zone9: failed to read out thermal zone (-61)

but yeah - a threshold of 50°C is very low for processors... i think it should be more something like/above 80-90°C...