coolsnowwolf / lede

Lean's LEDE source
Other
29.61k stars 19.51k forks source link

【已上传拆机图】R2S新的一批有内核奔溃问题,11月后新出厂的R2S貌似都有这个问题(已测试两个有此问题) #11692

Closed chobitsos closed 9 months ago

chobitsos commented 10 months ago

1.关于你要提交的问题

新款R2S (11月出厂后大孔指示灯版本)内核奔溃重启

Q:是否搜索了issue (使用 "x" 选择)

2. 详细叙述

最近购买的R2S大概是11月份出厂的,板子貌似有些变化,最直接的变化就是指示灯孔变大,旧款还是小孔的指示灯 在 https://github.com/coolsnowwolf/lede/commit/a8ddfc2e1025e2f35099b8757c7cf581cc87716b 此版本编译出来的固件刷入后 大约运行1小时左右开始内核奔溃并且自动重启 或流量大CPU大约满载时内核奔溃重启 然而旧版R2S(小孔指示灯版本)没有这个问题 从拿货渠道打听到友善订了1W片新的板子,并且设计上有些改变,最直观的就是指示灯变大孔

(1) 具体问题

A:

大约运行1小时左右内核奔溃并且自动重启 大约运行1小时左右内核奔溃死机 流量大时内核奔溃重启 流量大时内核奔溃死机 CPU满载时内核奔溃重启

(2) 路由器型号和固件版本

A:

R2S新款11月份出厂后版本(指示灯大孔版本) 编译版本 https://github.com/coolsnowwolf/lede/commit/a8ddfc2e1025e2f35099b8757c7cf581cc87716b

拆机图: IMG_E0922

(3) 详细日志

A:

↓↓↓↓↓完整Kernel Log↓↓↓↓↓ r2s_Kernel_Log.txt ↑↑↑↑↑完整Kernel Log↑↑↑↑↑

↓↓↓↓↓完整System Log↓↓↓↓↓ r2s_error.txt ↑↑↑↑↑完整System Log↑↑↑↑↑

[   27.591053] Unable to handle kernel paging request at virtual address ffd8b7f8ff525498
[   27.591787] Mem abort info:
[   27.592035]   ESR = 0x0000000096000004
[   27.592367]   EC = 0x25: DABT (current EL), IL = 32 bits
[   27.592834]   SET = 0, FnV = 0
[   27.593102]   EA = 0, S1PTW = 0
[   27.593382]   FSC = 0x04: level 0 translation fault
[   27.593812] Data abort info:
[   27.594124]   ISV = 0, ISS = 0x00000004
[   27.594468]   CM = 0, WnR = 0
[   27.594733] [ffd8b7f8ff525498] address between user and kernel address ranges
[   27.595359] Internal error: Oops: 0000000096000004 [#1] PREEMPT SMP
[   27.595911] Modules linked in: pppoe ppp_async iptable_nat xt_state xt_nat xt_conntrack xt_REDIRECT xt_MASQUERADE xt_FLOWOFFLOAD xt_CT pppox ppp_mppe ppp_generic nft_redir nft_ct nf_nat_tftp nf_nat_snmp_basic nf_nat_sip nf_nat_pptp nf_nat_irc nf_nat_h323 nf_nat_ftp nf_nat_amanda nf_nat nf_flow_table nf_conntrack_tftp nf_conntrack_snmp nf_conntrack_sip nf_conntrack_pptp nf_conntrack_irc nf_conntrack_h323 nf_conntrack_ftp nf_conntrack_broadcast nf_conntrack_bridge nf_conntrack_amanda nf_conntrack lzo ipt_REJECT cdc_ncm cdc_ether xt_time xt_tcpudp xt_quota xt_pkttype xt_owner xt_multiport xt_mark xt_mac xt_limit xt_comment xt_cgroup xt_addrtype xt_TPROXY xt_TCPMSS xt_LOG usbnet ums_usbat ums_sddr55 ums_sddr09 ums_karma ums_jumpshot ums_isd200 ums_freecom ums_datafab ums_cypress ums_alauda ts_kmp ts_fsm ts_bm tcp_bbr slhc r8152 nft_reject_ipv6 nft_reject_ipv4 nft_reject_inet nft_reject_bridge nft_reject nft_quota nft_objref nft_numgen nft_meta_bridge nft_log nft_limit nft_hash nft_counter
[   27.596118]  nf_tproxy_ipv6 nf_tproxy_ipv4 nf_tables nf_reject_ipv6 nf_reject_ipv4 nf_log_syslog nf_defrag_ipv6 nf_defrag_ipv4 macvlan lzo_rle lzo_decompress lzo_compress iptable_raw iptable_mangle iptable_filter ip_tables crc_ccitt br_netfilter asn1_decoder ntfs3 cryptodev xt_set x_tables ip_set_list_set ip_set_hash_netportnet ip_set_hash_netport ip_set_hash_netnet ip_set_hash_netiface ip_set_hash_net ip_set_hash_mac ip_set_hash_ipportnet ip_set_hash_ipportip ip_set_hash_ipport ip_set_hash_ipmark ip_set_hash_ipmac ip_set_hash_ip ip_set_bitmap_port ip_set_bitmap_ipmac ip_set_bitmap_ip ip_set nfnetlink tun nls_utf8 nls_cp437 crypto_user algif_skcipher algif_rng algif_hash algif_aead af_alg sha256_generic sha1_generic ecb authencesn authenc arc4 crypto_acompress uas gpio_button_hotplug vfat fat exfat
[   27.609920] CPU: 3 PID: 2377 Comm: sh Not tainted 5.15.136 #0
[   27.610431] Hardware name: FriendlyElec NanoPi R2S (DT)
[   27.610892] pstate: a0000005 (NzCv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--)
[   27.611505] pc : bsearch+0x58/0xc0
[   27.611824] lr : bsearch+0x50/0xc0
[   27.612126] sp : ffff80000bce3aa0
[   27.612417] x29: ffff80000bce3aa0 x28: ffff000000968000 x27: 0000000045585401
[   27.613052] x26: 0000000000000002 x25: ffff8000083f1590 x24: ffff80000bce3b08
[   27.613684] x23: ffff800008af7850 x22: 0000000000000008 x21: 0000000000000019
[   27.614316] x20: ffff800008af7918 x19: 0000000000000031 x18: 0000000000000000
[   27.614949] x17: 0000000000000001 x16: 0000000000000001 x15: 01b72a580051f4ea
[   27.615578] x14: 0000000000000011 x13: 0000000000000000 x12: ffff00000411f770
[   27.616210] x11: 0000000000000040 x10: ffff00000411f6c0 x9 : ffff800008cbfed8
[   27.616836] x8 : ffff80000bce3e20 x7 : 0000000000000000 x6 : 0000000000000000
[   27.617468] x5 : 000000000000032e x4 : ffff8000083f1590 x3 : 0000000000000008
[   27.618100] x2 : ffffffffff525498 x1 : ffff800008af79e0 x0 : ffd8b7f8ff525498
[   27.618736] Call trace:
[   27.618958]  bsearch+0x58/0xc0
[   27.619232]  search_extable+0x30/0x3c
[   27.619562]  search_exception_tables+0x38/0x70
[   27.619963]  do_page_fault+0x2e4/0x380
[   27.620302]  do_mem_abort+0x44/0xb0
[   27.620614]  el1_abort+0x40/0x64
[   27.620905]  el1h_64_sync_handler+0xd8/0xe4
[   27.621274]  el1h_64_sync+0x74/0x78
[   27.621585]  do_notify_resume+0x458/0x120c
[   27.621955]  el0_svc+0x44/0x54
[   27.622228]  el0t_64_sync_handler+0xa4/0x130
[   27.622602]  el0t_64_sync+0x180/0x184
[   27.622933] Code: aa1403e1 d63f0320 7100001f 34000220 (540001ad) 
[   27.623472] ---[ end trace cea385c85b6cb03b ]---

↓↓↓↓↓完整Kernel Log↓↓↓↓↓ r2s_Kernel_Log.txt ↑↑↑↑↑完整Kernel Log↑↑↑↑↑

↓↓↓↓↓完整System Log↓↓↓↓↓ r2s_error.txt ↑↑↑↑↑完整System Log↑↑↑↑↑

coolsnowwolf commented 10 months ago

超频后不稳定了

kxdn3 commented 10 months ago

U不行

chobitsos commented 10 months ago

超频后不稳定了

没有超频

chobitsos commented 10 months ago

U不行

但是旧款没有这个问题

coolsnowwolf commented 10 months ago

初步怀疑新的设计可能需要更高的 CPU 电压 #3f6bb72ee84d8730155669b484e9dad828832951 提交了个 patch 拉高0.05v 电压,你试试

chobitsos commented 10 months ago

初步怀疑新的设计可能需要更高的 CPU 电压 #3f6bb72ee84d8730155669b484e9dad828832951 提交了个 patch 拉高0.05v 电压,你试试

还是不行,貌似更严重了,随便打开个测速网站内核就奔了,流量稍微大一丢丢就内核奔溃重启了 都不带死机的,直接重启了 打开抖音不停的刷视频10个内必内核奔溃重启😂😂 不知道会不会是更换了板载闪存频率或者动了那一部分的板子逻辑导致,官方sdk也没更新什么😳

另外,实时输出日志有两个地方奔溃的地方貌似都跟CPU有关

CPU: 3 PID: 2167 Comm: ssr-redir Tainted: G        W         5.15.136 #0
WARNING: CPU: 3 PID: 2167 at net/ipv4/tcp_input.c:2981 tcp_fastretrans_alert+0x600/0x920

只要打开fast.com测速就能触发上面的奔溃信息,好像是ssr-redir一占用CPU就挂,当然我也测试其他占用CPU的软件 比如跑分软件,也是同样内核奔溃,看来奔溃都有个共同点就是一占用CPU就挂

↓↓↓↓此为实时输出日志↓↓↓↓ r2s_logread-f_logs.txt

r2s_Kernel_logs.txt r2s_System_logs.txt

板子已拆,右上角的一个芯片明显跟旧款不同,由于官方没有标注这颗芯片,所以不得而知这颗是什么,不会看电路图

官方芯片标注图 官方电路原理图

拿货渠道商说新板子有改动,并且更挑TF卡了,所以本来配32G的TF换了64G TF我也测试好几张,旧款机器测试无问题的TF卡放到新款上一样有问题,所以应该不是TF卡导致

IMG_E0922 IMG_0925 IMG_0922

chobitsos commented 10 months ago

IMG_E0922

还是找到了不一样的地方,由于官方没有标注这颗芯片,所以不得而知这颗是什么,不会看电路图 官方芯片标注图 官方电路原理图