Closed sdvillal closed 8 years ago
If you updated to 0.6.8 flydra then you will also need to update ros_flydra
@etiennecampione can you confirm this is new (did not happen before)?
Also: is this with the version of flydra which I just packaged and released a couple weeks ago? (What is the output of python -c "import flydra.version; print flydra.version.__version__"
?) Is this on a specific flycube?
The version 0.6.8 of flydra is installed (it is the output of "python -c "import flydra.version; print flydra.version.version"") and ros_flydra is updated ("git pull --ff-only") and re-built ("make clean", then "make"). I confirm that I did not have this problem before. I currently see it on both Flycubes 3 and 8 which would need a new calibration.
Yes, then update to git master of ros_flydra. Something like
roscd ros_flydra
git fetch
git merge origin/master
Please close this bug if that solves it for you.
@astraw: I already was on the last commit of the ros_flydra repository, so these command lines did not change anything and alas could not solve the problem. The last calibration I made was on the 17th December 2015, and I did not notice this problem at this time. So I fear it came with the new version of flydra.
Oh, yeah, you also have to make the latest ros_flydra
rosmake -s ros_flydra
Did you do that, too?
Relevant logs on the master side:
[rosout][INFO] 2016-02-09 11:47:22,196: received log message from Basler_21418139: excellent, grab thread running in maximum prioity mode
[rosout][INFO] 2016-02-09 11:47:22,202: received log message from Basler_21425980: using cam_iface driver: mega (wrapper: ctypes)
[rosout][INFO] 2016-02-09 11:47:22,210: received log message from Basler_21425975: using cam_iface driver: mega (wrapper: ctypes)
[rosout][INFO] 2016-02-09 11:47:22,211: received log message from Basler_21425975: excellent, grab thread running in maximum prioity mode
[rosout][INFO] 2016-02-09 11:47:22,214: received log message from Basler_21418139: using cam_iface driver: mega (wrapper: ctypes)
[rosout][INFO] 2016-02-09 11:47:22,421: received log message from Basler_21418139: Basler_21418139 frames apparently skipped: 56 (168 vs 111)
[rosout][INFO] 2016-02-09 11:47:22,421: received log message from Basler_21425990: Basler_21425990 frames apparently skipped: 98 (169 vs 70)
[rosout][INFO] 2016-02-09 11:47:22,422: received log message from Basler_21418139: Basler_21418139 frames apparently skipped: 50 (111 vs 60)
[rosout][INFO] 2016-02-09 11:47:22,429: received log message from Basler_21425975: Basler_21425975 frames apparently skipped: 48 (110 vs 61)
[rosout][INFO] 2016-02-09 11:47:22,627: triggerbox_client: waiting for clockmodel estimate
[rosout][INFO] 2016-02-09 11:47:23,128: triggerbox_client: waiting for clockmodel estimate
[rosout][INFO] 2016-02-09 11:47:23,628: triggerbox_client: got clockmodel estimate
[rosout][INFO] 2016-02-09 11:47:23,629: Basler_21425990 (re)synchronized
[rosout][INFO] 2016-02-09 11:47:23,629: Basler_21425985 (re)synchronized
[rosout][INFO] 2016-02-09 11:47:23,629: Basler_21425975 (re)synchronized
[rosout][INFO] 2016-02-09 11:47:23,629: Basler_21418139 (re)synchronized
[rosout][INFO] 2016-02-09 11:47:23,630: Basler_21425980 (re)synchronized
[rosout][WARNING] 2016-02-09 11:48:23,213: unknown Exception receiving timestamp echo data: timed out
[rospy.core][INFO] 2016-02-09 11:48:23,653: signal_shutdown [coordinate processor thread died]
[rospy.topics][ERROR] 2016-02-09 11:48:23,664: Traceback (most recent call last):
File "/opt/ros/ros.electric.boost1.46/ros_comm/clients/rospy/src/rospy/topics.py", line 245, in close
c.close()
File "/opt/ros/ros.electric.boost1.46/ros_comm/clients/rospy/src/rospy/impl/tcpros_base.py", line 761, in close
self.socket.close()
AttributeError: 'NoneType' object has no attribute 'close'
[rospy.topics][ERROR] 2016-02-09 11:48:23,664: Traceback (most recent call last):
File "/opt/ros/ros.electric.boost1.46/ros_comm/clients/rospy/src/rospy/topics.py", line 245, in close
c.close()
File "/opt/ros/ros.electric.boost1.46/ros_comm/clients/rospy/src/rospy/impl/tcpros_base.py", line 761, in close
self.socket.close()
AttributeError: 'NoneType' object has no attribute 'close'
[rospy.impl.masterslave][INFO] 2016-02-09 11:48:23,664: coordinate processor thread died
[rosout][INFO] 2016-02-09 11:48:24,279: mainbrain finished
[rosout][WARNING] 2016-02-09 11:48:26,779: cameras failed to quit cleanly: ['Basler_21425980', 'Basler_21425975', 'Basler_21425990', 'Basler_21418139', 'Basler_21425985']
[rosout][WARNING] 2016-02-09 11:48:29,280: cameras failed to quit cleanly: ['Basler_21425980', 'Basler_21425975', 'Basler_21425990', 'Basler_21418139', 'Basler_21425985']
[rospy.core][INFO] 2016-02-09 11:48:29,280: signal_shutdown [atexit]
It receives some log messages, and then the coordprocessor thread dies.
At first I thought the logging code I added was responsible, but now that I see it revieves some log messages I am not sure. Thoughts?
@astraw: I ran rosmake -s ros_flydra, this alas didn't fix the problem.
@sdvillal can you chmod -R g+rx /mnt/strawscience/santi/strawlab/calibration-transport-not-open-logs
done, FWIW you can do all permission fixups via strawcore
root@strawcore:~# ./fix-isilon-perms.sh /mnt/strawscience/santi/strawlab/calibration-transport-not-open-logs
Yeah, since some time already permissions in strawscience are sometimes broken even when creating files with permissions properly configured.
I just pushed 478e6ab which stops the crashes.
etienne is still unable to detect points, but I think that is unrelated
@etiennecampione writes: "Hereafter is the error message I almost always get (let's say 95% of the time) while calibrating a Flycube. Can you indicate me what is going on? It's a really annoying and time-consuming problem".
How does the transport die? Is it possible that the logging service (and therefore the transport) is closed in one camera LogMessageThread thread while there are still messages to log in some other camera thread, and so we would need to add sync code? Or could it just be a network problem? Surely @astraw will know better about this.
I'm saving the full logs of that run here; they include a few other stacktraces.