Closed jkotas closed 5 days ago
@egorbo It looks like the GC reporting is messed up around the new buld write barrier helper. Could you please take a look?
So far, I have seen it on native aot linux-arm only. We seem to have higher number of intermittent crashes than usual momentarily, with multiple different root causes. So it is not easy to tell whether this specific crash is hitting linux-arm only.
@EgorBo It looks like the GC reporting is messed up around the new buld write barrier helper. Could you please take a look?
So far, I have seen it on native aot linux-arm only. We seem to have higher number of intermittent crashes than usual momentarily, with multiple different root causes. So it is not easy to tell whether this specific crash is hitting linux-arm only.
@SingleAccretion made an interesting guess that it might be related to https://github.com/dotnet/runtime/issues/99410#issuecomment-2034385058 (hard to tell from the asm you attached whether it's tallcall arg setup region or not)
ah, very unlikely here, I don't have any arm32 device to test, but on 64bit we don't emit any tail calls in that function so seems unlikely
Seems like it's not failing anymore, very likely fixed by https://github.com/dotnet/runtime/pull/103301 which removed such helpers out of nogc blocks + potentially https://github.com/dotnet/runtime/pull/102580
Crash dumps:
https://dev.azure.com/dnceng-public/public/_build/results?buildId=666172&view=ms.vss-test-web.build-test-results-tab&runId=16523620&paneView=dotnet-dnceng.dnceng-build-release-tasks.helix-test-information-tab&resultId=141696
https://dev.azure.com/dnceng-public/public/_build/results?buildId=666172&view=ms.vss-test-web.build-test-results-tab&runId=16523620&paneView=dotnet-dnceng.dnceng-build-release-tasks.helix-test-information-tab&resultId=141697
Both of these are crashes while enumerating GC roots:
The stack trace of the target thread:
Target method: