Closed osrf-migration closed 11 years ago
Original comment by John Hsu (Bitbucket: hsu, GitHub: hsu).
I just tried running default branches of gazebo and drcsim in debug mode, but am unable to reproduce the crash.
It looks like a race condition in tmpPlaneBuffer array, where tmpPlaneBuffer[i + 1]
is not accessible (with address of 0x110
).
Can you do a "thread apply all bt" and see if there are multiple threads trying to access heightfield.cpp
?
Thanks, John
Original comment by John Hoare (Bitbucket: jhoare).
I did a backtrace across multiple threads, but I only see references to heightfield.cpp in the crashing thread (thread 1)
Thread 26 is in a malloc() call during the crash though, all the other threads look to be blocked or sleeping.
Stacktrace: http://pastebin.com/XJ6MBVKz
Edit: I am running tagged gazebo_1.3.1 and drcsim_1.3.1 I will try using the default branches now and verify the bug is still occuring.
It does seem strange that you can not reproduce the crash. We are able to replicate this bug on several different models of computers, however all are core i7 or xeon machines.
Original comment by John Hoare (Bitbucket: jhoare).
Okay, I did just ran the default branches of both gazebo & drcsim, and I am no longer seeing the crash occur. Or at least I ran the robot around for about 5x longer than I would have been able to before, without any crashes.
Original comment by John Hoare (Bitbucket: jhoare).
Lowered priority since using trunk version appears to mitigate bug.
Original comment by Nate Koenig (Bitbucket: Nathan Koenig).
I can't reproduce this problem. Is it safe to mark this as resolved?
Original comment by John Hoare (Bitbucket: jhoare).
Have you tried reproducing it with gazebo_1.3.1 and drcsim_1.3.1?
My concern with closing it is that there hasn't been a specific fix for it, so the bug is potentially still there. As John implied, this is likely caused by a race condition. The race condition may just be much less likely to occur in the newer (default) branches. I have had gazebo/drcsim crash on me with the newer branches, but with much less frequency and never with a resulting core dump or debugger attached to it.
Original comment by Nate Koenig (Bitbucket: Nathan Koenig).
Until after June, 2013.
Original comment by John Hoare (Bitbucket: jhoare).
I'll just close this out because I believe the issue still exists on the reported version, but I have (obviously) not been able to reproduce on newer versions.
Original report (archived issue) by John Hoare (Bitbucket: jhoare).
This was originally asked on answers.ros.org [1], but upon more inspection I'm convinced this is a bug.
Using the DRCSim Robot (I've tried versions 1.1,1.2, and 1.3, built both from source and the packaged versions) while using the "fake walking" and subscribing to the laser scan from the head, gazebo will crash.
Steps for re-creating the bug:
Here is a stack trace of what I get when gzserver crashes: http://pastebin.com/LYv3tt8t
I've filed this as a bug against gazebo rather than DRCSim because of the contents of the stack trace.
[1] http://answers.gazebosim.org/question/591/drcsim-1213-with-gazebo-crashes-often-when-moving/