After upgrading to the 4.9 kernel line, my weather station keeps locking up after 2 or 3 days.
Sometimes I cannot log in after this, but now I managed to scrape a few details.
No space/ memory/ swap issues. Used to work in same setup for over two years (with occasional Raspbian updates and rpi-update kernel updates). Moved to btrfs filesystem at around kernel 4.5 and it was fine until 4.9 came along. (Note, swap is on a separate partition.)
Any ideas how to approach this? Would be happy to try out a few things. It seems to be ok after a reboot. Will update to latest kernel 4.9.16 now.
Best regards,
Max
pi@raspberrypi ~ $ uname -a
Linux raspberrypi 4.9.14+ #977 Mon Mar 13 18:21:04 GMT 2017 armv6l GNU/Linux
This repo is just for the rpi-update utility. Could you recreate the issue here:
https://github.com/raspberrypi/linux/issues
where it's more likely to be seen by the right people.
After upgrading to the 4.9 kernel line, my weather station keeps locking up after 2 or 3 days. Sometimes I cannot log in after this, but now I managed to scrape a few details.
No space/ memory/ swap issues. Used to work in same setup for over two years (with occasional Raspbian updates and rpi-update kernel updates). Moved to btrfs filesystem at around kernel 4.5 and it was fine until 4.9 came along. (Note, swap is on a separate partition.)
Any ideas how to approach this? Would be happy to try out a few things. It seems to be ok after a reboot. Will update to latest kernel 4.9.16 now.
Best regards, Max
pi@raspberrypi ~ $ uname -a Linux raspberrypi 4.9.14+ #977 Mon Mar 13 18:21:04 GMT 2017 armv6l GNU/Linux
dmesg except: (Tofu below are square brackets)
[233136.361008] octave-cli: page allocation stalls for 10660ms, order:2, mode:0x24000c0(GFP_KERNEL) [233136.370269] CPU: 0 PID: 21214 Comm: octave-cli Not tainted 4.9.14+ #977 [233136.377216] Hardware name: BCM2835 [233136.380868] [] (unwind_backtrace) from [] (show_stack+0x20/0x24)
[233136.388960] [] (show_stack) from [] (dump_stack+0x20/0x28)
[233136.396536] [] (dump_stack) from [] (warn_alloc+0xf8/0x120)
[233136.404164] [] (warn_alloc) from [] (alloc_pages_nodemask+0xaec/0xd04)
[233136.415671] [] ( alloc_pages_nodemask) from [] (get_free_pages+0x20/0x70)
[233136.430524] [] ( get_free_pages) from [] (pgd_alloc+0x24/0x104)
[233136.444421] [] (pgd_alloc) from [] (mm_init+0xcc/0x168)
[233136.454699] [] (mm_init) from [] (copy_process.part.5+0x968/0x16c8)
[233136.468976] [] (copy_process.part.5) from [] (_do_fork+0xb4/0x3a0)
[233136.483236] [] (_do_fork) from [] (SyS_clone+0x30/0x38)
[233136.493652] [] (SyS_clone) from [] (ret_fast_syscall+0x0/0x1c)
[233136.507669] Mem-Info:
[233136.513262] active_anon:7030 inactive_anon:7049 isolated_anon:1
active_file:19465 inactive_file:17091 isolated_file:31
unevictable:0 dirty:54 writeback:3 unstable:0
slab_reclaimable:2189 slab_unreclaimable:1903
mapped:10587 shmem:2810 pagetables:360 bounce:0
free:590 free_pcp:0 free_cma:0
[233136.564313] Node 0 active_anon:28120kB inactive_anon:28196kB active_file:77860kB inactive_file:68364kB unevictable:0kB isolated(anon):4kB isolated(file):124kB mapped:42348kB dirty:216kB writeback:12kB shmem:11240kB writeback_tmp:0kB unstable:0kB pages_scanned:0 all_unreclaimable? no
[233136.600418] Normal free:2360kB min:1868kB low:2332kB high:2796kB active_anon:28120kB inactive_anon:28196kB active_file:77860kB inactive_file:68364kB unevictable:0kB writepending:228kB present:245760kB managed:233620kB mlocked:0kB slab_reclaimable:8756kB slab_unreclaimable:7612kB kernel_stack:1760kB pagetables:1440kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB
[233136.645890] lowmem_reserve[]: 0 0
[233136.651882] Normal: 5544kB (UM) 188kB (UM) 016kB 032kB 064kB 0128kB 0256kB 0512kB 01024kB 02048kB 0*4096kB = 2360kB
[233136.668449] 39747 total pagecache pages
[233136.673433] 368 pages in swap cache
[233136.679284] Swap cache stats: add 9416, delete 9048, find 187356/188426
[233136.688374] Free swap = 250180kB
[233136.694109] Total swap = 265212kB
[233136.699709] 61440 pages RAM
[233136.704778] 0 pages HighMem/MovableOnly
[233136.710806] 3035 pages reserved
[233136.716150] 2048 pages cma reserved
culprit is 21214 octave-cli
pi@raspberrypi ~ $ ps aux --forest|cat USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND root 2 0.0 0.0 0 0 ? S Mar16 0:00 [kthreadd] root 3 0.0 0.0 0 0 ? S Mar16 1:26 _ [ksoftirqd/0] root 5 0.0 0.0 0 0 ? S< Mar16 0:00 _ [kworker/0:0H] root 7 0.0 0.0 0 0 ? S< Mar16 0:00 _ [lru-add-drain] root 8 0.0 0.0 0 0 ? S Mar16 0:00 _ [kdevtmpfs] root 9 0.0 0.0 0 0 ? S< Mar16 0:00 _ [netns] root 10 0.0 0.0 0 0 ? S Mar16 0:00 _ [khungtaskd] root 11 0.0 0.0 0 0 ? S Mar16 0:00 _ [oomreaper] root 12 0.0 0.0 0 0 ? S< Mar16 0:00 _ [writeback] root 13 0.0 0.0 0 0 ? S Mar16 0:03 _ [kcompactd0] root 14 0.0 0.0 0 0 ? S< Mar16 0:00 _ [crypto] root 15 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 16 0.0 0.0 0 0 ? S< Mar16 0:00 _ [kblockd] root 17 0.0 0.0 0 0 ? S< Mar16 0:00 _ [watchdogd] root 19 0.0 0.0 0 0 ? S< Mar16 0:00 _ [rpciod] root 20 0.0 0.0 0 0 ? S< Mar16 0:00 _ [xprtiod] root 23 0.0 0.0 0 0 ? S Mar16 1:13 _ [kswapd0] root 24 0.0 0.0 0 0 ? S< Mar16 0:00 _ [nfsiod] root 34 0.0 0.0 0 0 ? S< Mar16 0:00 _ [kthrotld] root 35 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 36 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 37 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 38 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 39 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 40 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 41 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 42 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 43 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 44 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 45 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 46 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 47 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 48 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 49 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 50 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 51 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 52 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 53 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 54 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 55 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 56 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 57 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 58 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 59 0.0 0.0 0 0 ? S< Mar16 0:00 \ [iscsieh] root 60 0.0 0.0 0 0 ? S< Mar16 0:00 \ [dwcotg] root 61 0.0 0.0 0 0 ? S< Mar16 0:00 _ [DWC Notificatio] root 62 0.0 0.0 0 0 ? S< Mar16 0:00 _ [VCHIQ-0] root 63 0.0 0.0 0 0 ? S< Mar16 0:00 _ [VCHIQr-0] root 64 0.0 0.0 0 0 ? S< Mar16 0:00 _ [VCHIQs-0] root 65 0.0 0.0 0 0 ? S Mar16 0:00 _ [VCHIQka-0] root 77 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 80 0.0 0.0 0 0 ? S Mar16 3:15 _ [mmcqd/0] root 112 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 141 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-worker] root 142 0.0 0.0 0 0 ? S< Mar16 0:00 _ [kworker/u3:0] root 143 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-worker-hi] root 144 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-delalloc] root 145 0.0 0.0 0 0 ? S< Mar16 0:00 \ [btrfs-flushdel] root 146 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-cache] root 147 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-submit] root 148 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-fixup] root 149 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-endio] root 150 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-endio-met] root 151 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-endio-met] root 152 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-endio-rai] root 153 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-endio-rep] root 154 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-rmw] root 155 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-endio-wri] root 156 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-freespace] root 157 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-delayed-m] root 158 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-readahead] root 159 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-qgroup-re] root 160 0.0 0.0 0 0 ? S< Mar16 0:00 _ [btrfs-extent-re] root 161 0.0 0.0 0 0 ? S Mar16 0:00 _ [btrfs-cleaner] root 162 0.1 0.0 0 0 ? S Mar16 4:46 _ [btrfs-transacti] root 178 0.0 0.0 0 0 ? S< Mar16 0:02 _ [kworker/0:1H] root 181 0.0 0.0 0 0 ? S< Mar16 0:00 \ [ipv6addrconf] root 211 0.9 0.0 0 0 ? S Mar16 40:25 \ [w1_busmaster1] root 294 0.0 0.0 0 0 ? S Mar16 0:00 _ [spi0] root 380 0.0 0.0 0 0 ? S< Mar16 0:00 _ [cfg80211] root 386 0.0 0.0 0 0 ? S< Mar16 0:58 _ [kworker/u3:1] root 467 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 468 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 469 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 470 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 471 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 472 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 473 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 474 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 475 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 476 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 477 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 478 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 479 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 480 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 481 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 482 0.0 0.0 0 0 ? S< Mar16 0:00 _ [bioset] root 19619 0.0 0.0 0 0 ? S 11:55 0:03 _ [kworker/u2:5] root 21696 0.0 0.0 0 0 ? D 12:30 0:04 | _ [kworker/u2:5] root 22160 0.0 0.0 0 0 ? S 18:12 0:00 _ [kworker/u2:0] root 22178 0.0 0.0 0 0 ? S 18:30 0:01 _ [kworker/u2:2] root 22193 0.1 0.0 0 0 ? S 18:43 0:02 _ [kworker/0:3] root 22196 0.1 0.0 0 0 ? S 18:48 0:01 _ [kworker/0:2] root 22205 0.0 0.0 0 0 ? S 18:58 0:00 _ [kworker/u2:4] root 22271 0.1 0.0 0 0 ? S 19:07 0:00 _ [kworker/0:0] root 22274 0.0 0.0 0 0 ? S 19:10 0:00 _ [kworker/u2:1] root 22280 0.0 0.0 0 0 ? S 19:14 0:00 \ [kworker/0:1] root 1 0.0 1.4 5464 3396 ? Ss Mar16 0:26 /sbin/init root 224 0.0 0.8 8104 2092 ? Ss Mar16 2:07 /lib/systemd/systemd-journald root 230 0.0 0.6 11988 1468 ? Ss Mar16 0:00 /lib/systemd/systemd-udevd root 511 0.0 1.2 7144 2816 ? Ss Mar16 0:23 /sbin/wpa_supplicant -s -B -P /run/wpa_supplicant.wlan0.pid -i wlan0 -W -D nl80211,wext -c /etc/wpa_supplicant/wpasupplicant.conf root 568 0.0 1.0 31240 2352 ? Ssl Mar16 0:25 /usr/sbin/rsyslogd -n root 571 0.0 0.9 7856 2300 ? Ss Mar16 0:08 /usr/sbin/sshd -D root 21690 0.0 1.8 9200 4424 ? Ds 12:29 0:05 _ sshd: [accepted]
root 21616 0.0 1.1 7016 2672 ? S 12:29 0:00 _ /usr/sbin/CRON -f
root 21625 0.0 0.4 1900 1116 ? Ss 12:29 0:00 | _ /bin/sh -c cd /root && ./sensorcron
root 21629 0.0 1.1 4604 2648 ? S 12:29 0:00 | _ /bin/bash ./sensorcron
root 21676 0.0 0.7 4604 1820 ? D 12:29 0:04 | _ /bin/bash ./sensorcron
root 21695 0.0 1.0 7016 2528 ? D 12:30 0:00 _ /usr/sbin/CRON -f
pi 22206 0.0 1.4 4964 3320 ? Ss 18:58 0:00 /lib/systemd/systemd --user
pi 22209 0.0 0.8 7352 2056 ? S 18:58 0:00 \ (sd-pam)
root 22201 0.0 2.3 12600 5472 ? Ss 18:58 0:00 _ sshd: pi [priv]
pi 22215 0.1 1.5 12740 3632 ? S 18:58 0:01 _ sshd: pi@pts/0
pi 22217 0.2 1.9 6340 4508 pts/0 Ss 18:58 0:02 _ -bash pi 22284 0.0 0.9 4884 2316 pts/0 R+ 19:16 0:00 _ ps aux --forest pi 22285 0.0 0.6 3540 1452 pts/0 S+ 19:16 0:00 _ cat root 576 0.0 1.3 40200 3264 ? Ssl Mar16 0:00 /usr/sbin/ModemManager avahi 577 0.0 0.9 3864 2132 ? Ss Mar16 0:04 avahi-daemon: running [raspberrypi-2.local] avahi 626 0.0 0.3 3864 824 ? S Mar16 0:00 \ avahi-daemon: chroot helper root 585 0.0 0.8 3844 2092 ? Ss Mar16 0:03 /lib/systemd/systemd-logind message+ 606 0.0 0.9 5736 2164 ? Ss Mar16 0:01 /usr/bin/dbus-daemon --system --address=systemd: --nofork --nopidfile --systemd-activation nobody 620 0.0 0.4 2280 1140 ? Ss Mar16 0:05 /usr/sbin/thd --daemon --triggers /etc/triggerhappy/triggers.d/ --socket /var/run/thd.socket --pidfile /var/run/thd.pid --user nobody /dev/input/event* ntp 633 0.0 0.8 5676 2064 ? Ss Mar16 0:53 /usr/sbin/ntpd -p /var/run/ntpd.pid -g -u 102:104 root 639 0.0 0.6 12368 1496 ? Ss Mar16 0:00 /usr/sbin/cupsd -f root 641 0.0 0.9 10204 2316 ? Ss Mar16 0:00 /usr/sbin/cups-browsed root 653 0.0 1.5 39644 3604 ? Ssl Mar16 0:00 /usr/lib/policykit-1/polkitd --no-debug root 666 0.0 0.5 4040 1232 tty1 Ss+ Mar16 0:00 /sbin/agetty --noclear tty1 linux root 667 0.0 0.5 3860 1236 ? Ss+ Mar16 0:00 /sbin/agetty --keep-baud 115200 38400 9600 ttyAMA0 vt102 www-data 693 0.0 0.9 8140 2176 ? Ss Mar16 0:23 /usr/sbin/lighttpd -D -f /etc/lighttpd/lighttpd.conf root 713 0.0 0.5 2576 1228 ? Ss Mar16 0:11 /sbin/wpa_cli -B -P /run/wpa_action.wlan0.pid -i wlan0 -p /var/run/wpa_supplicant -a /sbin/wpaaction root 751 0.0 0.7 8716 1664 ? Ss Mar16 0:00 dhclient -v -pf /run/dhclient.wlan0.pid -lf /var/lib/dhcp/dhclient.wlan0.leases wlan0 root 865 0.0 1.7 30616 3984 ? Ssl Mar16 0:00 /usr/sbin/console-kit-daemon --no-daemon root 2656 0.0 0.7 5088 1648 ? Ds Mar16 0:18 /usr/sbin/cron -f root 21191 0.0 1.1 7016 2672 ? S 12:19 0:00 _ /usr/sbin/CRON -f root 21204 0.0 0.4 1900 1104 ? Ss 12:19 0:00 | _ /bin/sh -c cd /root && ./sensorplots root 21206 0.0 1.1 4604 2636 ? S 12:19 0:00 | _ /bin/bash ./sensorplots root 21214 0.0 15.6 126792 36612 ? D 12:19 0:13 | _ /usr/bin/octave-cli -H /root/sensorplots-20170319.m root 21614 0.0 0.0 0 0 ? Z 12:29 0:00 _ [cron]