Open jgsogo opened 4 months ago
Just to add, our understanding of the execution environment is:
I finally found that adding the following .bazelrc worked:
startup --host_jvm_args=-XX:-UseContainerSupport
Obviously it may be necessary to explicitly set memory, cpu to get the best performance — but at least there’s no exception now and bazel runs.
There's probably still a jvm issue - so please let me know if there's other things you'd like me to try to capture more info Additionally I think the system config of this machine is a bit unique -- something we'll look at in our environment
Looks like https://bugs.openjdk.org/browse/JDK-8286212, which is still open.
Description of the bug:
Bazel server crashes at the very beginning (stack trace starts at
CollectLocalResourceUsage
).The bug happens in a CI:
icr.io/continuous-delivery/pipeline/pipeline-base-ubi:3.40
root
openjdk version "17.0.10" 2024-01-16 IBM Semeru Runtime Open Edition 17.0.10.0 (build 17.0.10+7)
Locally (MacOS) everything works perfectly. It looks like it's related to the container runtime (related?).
Note.-
--noexperimental_collect_resource_estimation
but it looks like this flag has no effect (I don't really know if it should skip thisCollectLocalResourceUsage
step).Which category does this issue belong to?
Core
What's the simplest, easiest way to reproduce this bug? Please provide a minimal example if possible.
To reproduce the bug a simple
bazel build //...
is enough... however, it only fails in some environment under some circunstances (probably depends on the container runtime), so I think it's not easy to reproduce.Which operating system are you running Bazel on?
No response
What is the output of
bazel info release
?It also fails
If
bazel info release
returnsdevelopment version
or(@non-git)
, tell us how you built Bazel.No response
What's the output of
git remote get-url origin; git rev-parse HEAD
?No response
If this is a regression, please try to identify the Bazel commit where the bug was introduced with bazelisk --bisect.
No response
Have you found anything relevant by searching the web?
Any other information, logs, or outputs that you want to share?
Is there any CLI flag I can use to "bypass" this resource collection (and avoid the issue)? Thanks!