Closed azdle closed 1 year ago
This is a bug in rr
and seems to have nothing to do with suave or ROS. Sorry for the noise.
#1 0x00007ff0147297d9 in _ZNSt13random_device7_M_initERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE (this=0x7ffc53864000, token=0x7ffc53863fe0) at ./src/preload/overrides.c:233
is coming from rr: https://github.com/rr-debugger/rr/blob/5.5.0/src/preload/overrides.c#L233
Hi,
I work for a company called Auxon that makes trace-based testing and verification tools primarily for cyber-phyical systems. We're trying to put together a demonstration based on suave. However, we're seeing some segfaults that our ROS tooling seems to be making worse, but that seems to already exist without it. I'm wondering if you already know anything about these problems.
For reference, this is running suave straight from this repo without any of our tooling, commit beb4712, within docker, by running the command
./build_docker_images.sh && docker run -it --shm-size=512m -p 6901:6901 -e VNC_PW=password --security-opt seccomp=unconfined -v ~/suave_results:/home/kasm-user/suave/results suave:dev
and then runningrr ./example_run.sh
within the container. The only change that I've made is to installgdb
andrr
to be able to debug the system.When I do a run, it does seem to work as intended, the vehicle finds the pipe and follows it until the mission times-out, but I see what I believe is the ROS daemon itself has segfaulted:
When we add our instrumentation for ros topic publisher and subscriber tracing we are seeing much more variable segfaulting happening within the instrumentation in many different places/processes, but I haven't been able to nail down an exact cause for that yet and I'm currently assuming that this segfault may be related and since it is much more consistent I'm trying to diagnose this one first.
I'm going to continue trying to get more information out of this system about what is going on, but I'm pretty new to ROS so I just wanted to see if there's anything that stands out to you as obvious for what could be wrong.
Also, if you happen to know off hand:
rr
to not see them. I'm seeing segfault exit codes logged for some process, especially suave reasoner, but the processes aren't able to be captured byrr
.Those are things that are in my list of things that I'm going to try to figure out at some point that have stumped me in the brief attempts I've made so far.
Thanks for your time, I'd love to hear any thoughts you have on this.