Closed mjsobrep closed 4 years ago
Uhh, that looks like a memory bug, since we get EFAULT ("Bad address") from execvp()... Could you please specify which ROS + rosmon version you are using? Repository packages or built from source?
I recall that we fixed a nasty bug in #103, maybe this is related.
This is on Kinetic coming from repository:
Package: ros-kinetic-rosmon Status: install ok installed Priority: extra Section: misc Installed-Size: 29 Maintainer: Max Schwarz <max.schwarz@uni-bonn.de> Architecture: amd64 Version: 2.1.1-1xenial-20200229-053707+0000 Depends: ros-kinetic-rosmon-core, ros-kinetic-rqt-rosmon
A bit more intuition: It seems like all of the nodes that are having trouble are either capturing from cameras or in the same namespace as a camera capturing node. There are other nodes using USB devices that seem ok. This system is often shut down messily (the os is shutdown without regard for what is running).
Looks the same as #103 which was fixed in v2.2.1. On Kinetic, 2.1.1-1 is the released version. Any chance we can get a bump in released version or should I just build from source?
Yes, sure. I always delay the Kinetic release a bit to test things on Melodic first, but this time I simply forgot to do the Kinetic update.
I triggered a release right now. The rosdistro PR is here: https://github.com/ros/rosdistro/pull/24555
Awesome, thanks
The packages have built, but are not pushed into the main repositories yet. If you like & have time, you could already test them:
The package sync is out => rosmon 2.2.1 including the fix for this issue can be installed from the repositories. I'll close this now, feel free to reopen if the issue still persists :)
I have this occasional problem working with rosmon (which is otherwise incredible, thank you) where on launch ~half of my nodes crash. It seems to be a problem from an interaction with rosmon. Sometimes restarting the system solves it, sometimes it does not. I think that rosmon is building directories to keep track of something and not properly cleaning them up / not tolerating messy shutdowns? Any thoughts on how to fix this?
Snippet of the print out after a failed launch: