Closed rethil closed 9 years ago
This looks more serious indeed. Most of the stuff at pastebin comes from realtek driver, though. I hope it's not that driver which is causing problems. I will take a closer look at this once I'm done with the other issue you reported ;) Would it be a problem for you to upgrade your kernel to v4.1 or even https://git.kernel.org/cgit/linux/kernel/git/kvalo/wireless-drivers-next.git/ and see if those errors occur again? It's not super necessary but may make things easier for me down the road.
I'll upgrade my kernel to v4.1 this evening and see if I can see these errors again.
OK, the first one is fixed in testing
branch as well. The upgrade is not 100% necessary so if you don't feel like doing it, you can keep rollin' with 4.0.5.
I'm trying to reproduce the second one with no luck so far.
Third one is complete hung of the card. Little chance I will be able to fix that one (as it would probably require MediaTek to change the firmware).
I was unable to upgrade my system with 4.1 kernel. Fix for tx/rx seems to be doing it job on v4.0.5. Second error was easly reproducable as far as I remember. I'll try to reproduce it with testing branch. For 3rd: it only happened once while running for 1 night straight.
Actually the initial fix for first issue which I pushed earlier killed performance of TCP from the host (<5Mbps). I just updated the testing
branch to fix that. Can you try that? Let me know if you see any issues. Your help is much appreciated :)
As for the second and third problem I created a setup with NAT (ethernet <-> mt7601u) but no luck reproducing them so far...
Tried out latest commit: there are some kernel messages popping out. I just simply ran: wget -O /dev/null http://speedtest.wdc01.softlayer.com/downloads/test100.zip
[ 32.327978] mt7601u 1-1.2:1.0 wlan1: disabling HT/VHT due to WEP/TKIP use
[ 32.328253] mt7601u 1-1.2:1.0 wlan1: disabling HT as WMM/QoS is not supported by the AP
[ 32.328580] mt7601u 1-1.2:1.0 wlan1: disabling VHT as WMM/QoS is not supported by the AP
[ 131.699802] wget (307) used greatest stack depth: 2768 bytes left
[ 163.345903] mt7601u 1-1.2:1.0: Error: BBP read 31 failed:-110!!
[ 163.701233] mt7601u 1-1.2:1.0: Error: BBP read 31 failed:-110!!
[ 164.076691] mt7601u 1-1.2:1.0: Error: RF read 00:04 failed:-110!!
[ 172.337635] mt7601u 1-1.2:1.0: Error: BBP read 2f failed:-110!!
[ 180.589873] mt7601u 1-1.2:1.0: Error: BBP read 31 failed:-110!!
[ 193.183838] mt7601u 1-1.2:1.0: Error: BBP read 42 failed:-110!!
[ 197.456808] mt7601u 1-1.2:1.0: Error: BBP read 42 failed:-110!!
[ 201.746887] mt7601u 1-1.2:1.0: Error: BBP read 31 failed:-110!!
[ 210.236258] mt7601u 1-1.2:1.0: Error: BBP read 42 failed:-110!!
[ 239.433631] mt7601u 1-1.2:1.0: Error: BBP read 2f failed:-110!!
[ 247.921059] mt7601u 1-1.2:1.0: Error: BBP read b2 failed:-110!!
[ 260.310126] mt7601u 1-1.2:1.0: Error: BBP read 42 failed:-110!!
[ 264.656379] mt7601u 1-1.2:1.0: Error: BBP read 42 failed:-110!!
[ 273.006451] mt7601u 1-1.2:1.0: Error: BBP read 31 failed:-110!!
[ 277.236589] mt7601u 1-1.2:1.0: Error: BBP read 31 failed:-110!!
[ 277.348404] mt7601u 1-1.2:1.0: Error: BBP read 01 failed:-110!!
[ 302.577326] mt7601u 1-1.2:1.0: Error: BBP read 31 failed:-110!!
[ 310.919364] mt7601u 1-1.2:1.0: Error: BBP read 31 failed:-110!!
[ 326.488685] wget (310) used greatest stack depth: 2112 bytes left
Traffic was getting lower when each of error message was showing up.
I can also see a traffic loss between 2223538 and e5dc358. 2223538 - ~1mb/s constant traffic e5dc358 - ~0.5-0.8mb/s and usually dropping to 10kb/s
I was able to reproduce the timeouts (-110 errors). I will try to reproduce the traffic drop between 2223538 and e5dc358 in my setup as well... but tomorrow. I have enough debugging for one day ;)
Looks like rtl8188 were caused by hostapd. I was using https://github.com/jenssegers/RTL8188-hostapd. Now I switched to hostapd 2.4 patched with https://github.com/pritambaral/hostapd-rtl871xdrv and it's more stable. Kernel still shows error messages not related to mt7601u. You can get that issue of the list. 2nd issue: I'm also not able to reproduce it now.
OK @Rethil! Big thanks for report and help, I think doing proper locking fixed the -110 errors
. I've been running mt7601u in NAT configuration for almost 20 hours with no hickups/errors.
Moreover I don't see any performance issues. master
and testing
run at approximately the same speed for me. Let me know if speed regression persists in your configuration. Beware that speed depends on many factors like noise (other users of 2.4GHz band), temperature of the device, placement of objects around the devices, weather you are standing next to it or not...
I pushed the fixes to master
branch. Feel free to open a new issue if you have more problems.
Great, thanks a lot ;). Everything seems to be working fine with dirver from testing branch. If anything I spot something bad I'll let you know :+1:
There are a lot of kernel messages while stressing out mt7601u device. My setup has 2 usb wifi dongles:
I was able to capture few backtraces. Errors showed up while downloading/uploading bigger ammount of data.
Additionally when I turned on some kernel debug features this showed up at somepoint: http://pastebin.com/V591G8Fg