Open terrillmoore opened 7 years ago
@kersing any update on this?
@johanstokking There have been improvements and code (and MultiTech builds) should be more stable. However the issue is still not 100% tackled. I'm still trying to find the cause of the remaining issue. Making progress, but very slowly as it is hard to reproduce.
I understand. I really appreciate the work on this.
Anything we, the community, can do?
I've updated the sources of several components in an attempt to solve this issue. RPi users and people creating their own builds should pull updates and rebuild.
An updated build for MultiTech is available from https://raw.github.com/kersing/multitech-installer/master/ (to update conduits running an older version just add a file '/etc/opkg/mp-feed.conf' containing one line:
src/gz mpfwd https://raw.github.com/kersing/multitech-installer/master/
run opkg install mp-packet-forwarder
to update the software. Restart /etc/init.d/ttn-pkt-forwarder restart
after updating.
I have mp-packet-forwarder (3.0.0-r14)
installed. And I get this:
00:17:18 INFO: Description configured to ""
00:17:18 INFO: [Transports] Initializing protocol for 1 servers
00:17:19 INFO: [TTN] server "192.168.0.21" connected
00:17:19 INFO: [main] Starting the concentrator
ERROR: FAIL TO CONNECT BOARD
INFO: FPGA supported features: [TX filter] [Spectral Scan]
00:17:19 ERROR: [main] failed to start the concentrator
00:17:21 *** Multi Protocol Packet Forwarder for Lora Gateway ***
Version: 3.0.10
Is this the same issue?
NOTES:
Any suggestions are more than welcomed. Thanks!
Not the same issue. Have you used https://www.thethingsnetwork.org/docs/gateways/multitech/ to install the software? If not please use those instructions and if you still have issues go to the TTN forum and request help there first, when requesting help include the full log of the software including the first lines with the version numbers. (I expect your current setup tries to use the USB version of the software where the SPI version is required.)
@kersing Alright. Thanks for your fast response.
I followed https://www.thethingsnetwork.org/docs/gateways/multitech/mlinux.html
I already manually configured the internet connection of the gateway, so I just wget
that installer.
Is there a difference between that and this? https://github.com/kersing/multitech-installer/blob/master/installer.sh
How can I tell which is which? USB vs SPI versions.
Thanks. I appreciate your help here, even if we're on a different issue thread.
For help with the installer please use the TTN forum. These issues are used for tracking software defects.
Hy all,
we are running the Packet forwarder spf_3.1.0-klk18_4.1.3-klk12_klk_wifc on a Kerlink iFemtocell. As the issue is very similar to this post I try to get some information here.
The SPF is restarted frequently, every 10 minutes to 1 hour.
in the system logs (/var/log/messages) we find: Sep 7 09:29:53 klk-wifc-040187 user.err monit[1138]: 'spf' process is not running Sep 7 09:29:53 klk-wifc-040187 user.info monit[1138]: 'spf' trying to restart Sep 7 09:29:53 klk-wifc-040187 user.info monit[1138]: 'spf' start: '/user/spf/bin/execute_spf.sh' Sep 7 09:29:58 klk-wifc-040187 daemon.debug netctl[919]: Wi-Fi scan finished Sep 7 09:29:58 klk-wifc-040187 daemon.debug netctl[919]: AP found: Funknetz2 (online-69-psk/wps-True) Sep 7 09:29:58 klk-wifc-040187 daemon.info lighttpd[1086]: 192.168.178.94 192.168.178.45 - [07/Sep/2019:09:29:58 +0000] "GET /application/administration/wlan HTTP/1.1" 200 237 "http://192.168.178.45/" "Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/76.0.3809.132 Safari/537.36" Sep 7 09:30:04 klk-wifc-040187 user.info monit[1138]: 'spf' process is running with pid 3325 Sep 7 09:37:15 klk-wifc-040187 local3.info KLK: STATM INFO CPU usage: 0.8 Sep 7 09:37:15 klk-wifc-040187 local3.info KLK: STATM INFO RAM usage: 33.3 Sep 7 09:44:37 klk-wifc-040187 user.err monit[1138]: 'spf' process is not running Sep 7 09:44:37 klk-wifc-040187 user.info monit[1138]: 'spf' trying to restart Sep 7 09:44:37 klk-wifc-040187 user.info monit[1138]: 'spf' start: '/user/spf/bin/execute_spf.sh' Sep 7 09:44:48 klk-wifc-040187 user.info monit[1138]: 'spf' process is running with pid 3353
in the spf.log we find:
Sep 7 09:29:50 klk-wifc-040187 local1.notice spf: INFO: concentrator stopped successfully Sep 7 09:29:50 klk-wifc-040187 local1.notice spf: INFO: Exiting packet forwarder program
which looks like a regular stop.
Any known reason, why the SPF ist stopped??
Best regards, Eckehard
This issue documents a problem report that first was reported on www.thethingsnetwork.org/forums: Latest MultiTech Packet Forwarder Stops Sending Packets.
The brief form: after updating to V3.0.0-r5 from V2.x, we observe hangs of the packet router code. Easiest way to detect this is to look at the timestamp on
/var/log/lora-pkt-fwd.log
:(Note the difference in time.)
The forum thread may describe additional issues; this thread only documents those observed by me. I have seen symptoms like this on other gateways, but only finally caught things in the act yesterday.
Status of the packet forwarder (from
ps au
) was:I believe that the
Sl
flags are significant.Restarting the packet forwarder does not clear the problem.
The relevant portion of
/var/log/lora-packet-fwd.log
(after the above restart) was:Rebooting the Conduit with
init 6
solved the problem.During this hang, no traffic was forwarded upstream.
Both observed failures were correlated with downlink (join) traffic. (This application otherwise does not use downlink traffic, and there are no other known applications on the gateway.)
Gateway version info:
The desperation workaround is to have a daemon watching the timestamp on the log, and if it becomes older than one or two minutes, to reboot the Conduit.