Closed johnhcwang closed 8 years ago
Can you please paste the output of dmesg |grep ftgmac100
?
dmesg |grep ftgmac100
ftgmac100 1e660000.ethernet: Using NCSI interface
ftgmac100 1e660000.ethernet: Read MAC address from chip 52:a6:f6:0e:6c:02
ftgmac100: NCSI interface down
I think you are hitting a race condition in the network driver. This has since been fixed in newer versions of the kernel.
I can provide a backport of the fix.
That's a good news. I'll pick your fix for verifying again. Thanks.
Hi @shenki , I saw that you tag openbmc-4.4-20160722-1 and move to the stable kernel 4.4.15 on v1.0-stable branch, does it include the fix for this issue?
No, I did that before you opened this issue.
I made a proposed fix today but it was incorrect. Will try again tomorrow.
hi @gwshan, I believe you're sending the correct fix to Joel, could that be done this week? Thanks!
Yeah, I'm working on this and will post the fix ASAP. I guess it's the issue reported from dev-4.4 as Joel mentioned to me in IRC.
The fix requested by Joel sent to openbmc maillist, awaiting for Joel's review & pickup.
Resolved with openbmc/openbmc@d470b1f154b11b86d23438eff1f58d7f592c6d2a
Hi @shenki , I still get this issue on latest obmc v1.0.4 and can't restart the eth0 with the SIOCSIFFLAGS error. Does it mean that I still get the race condition because the result of dmesg is the same as previous message?
root@barreleye:~# dmesg |grep ftgmac100
ftgmac100 1e660000.ethernet: Using NCSI interface
ftgmac100 1e660000.ethernet: Read MAC address from chip 32:c6:fa:0d:e6:ae
ftgmac100: NCSI interface down
root@barreleye:~# obmcutil state
= HOST_BOOTED
root@barreleye:~# uname -a
Linux barreleye 4.4.16-openbmc-4.4-20160804-1 #1 Tue Aug 9 17:30:01 CST 2016 armv5tejl GNU/Linux
Hi @johnhcwang , please open a new issue in openbmc/openbmc and add a reference to this issue. Thanks.
Hi @johnhcwang, please let me know the best way (IRC/timezone/email etc) to contact you so that I can understand more about the issue, thanks! Currently, I cannot access a barreleye.
I had a talk with @johnhcwang. It's likely the story: No NCSI channels are probed from BCM5719 when bringing up the network interface for the first time. As the NCSI channel enumeration is done for once, the network interface doesn't have workable NCSI device assocated, meaning the network interface won't work later on.
I will provide tentative patch for @johnhcwang to try as discussed, thanks!
I'm not familiar to network initialization, so attach to the journal log and capture some strange messages for your information. Welcome any suggestion that can help me to debug this issue.
When this issue occurred, there're some error messages about networkd as below. lan_fail.txt
And then I tried to restart eth0 device, but got the SIOCSIFFLAGS error. lan_fail_1.txt
The journal log shows that it can't bring up this interface.