Open monwolf opened 2 years ago
@mpirvu : could you have someone triage this problem please?
@gacholio does OpenJ9 support asynchronous sampling at the moment? I know you did some work in this area, but I don't know the outcome. Thanks
Yes, this has been implemented. This crash is not in the async sampler which implies either it's not a problem with AsyncGetCallTrace at all, or the async call is corrupting the stack resulting in a later crash.
Another consumer of ASGCT had to modify their signal handler to avoid a large on-stack buffer for the returned frames.
Some details may be found in #13838
@monwolf Could you please help us reproduce this issue? What would be the simplest setup that shows this bug? Personally, I have no experience with the Datadog profiler. Thanks
I haven't had luck trying to reproduce with the petclinic springboot app. Tomorrow I'll try again.
@monwolf any news regarding this issue ? We are facing same failure. We upgraded to Eclipse OpenJ9 VM 11.0.17.0 (build openj9-0.35.0, JRE 11 Linux amd64-64-Bit Compressed References 20221031_559 (JIT enabled, AOT enabled
) but didn't help.
I also confirmed that after disabling datadog agent things are running well.
Java -version output
Output from
java -version
.Summary of problem
When enabling the async profiler with datadog module, we start seeing crashes in the service, https://github.com/DataDog/dd-trace-java/issues/3616, I don't know if related with datadog agent implementation or with the JIT.
Stderr:
As we are running this on a volatile environment we haven't access to the coredumps and crash files.