Closed JSGette closed 11 months ago
Hey! Make sure you have the following things set up:
-g
to tini
. Otherwise tini
will end up sending signals to your build actions, which it shouldn't. Only bb_runner
should do that. https://github.com/buildbarn/bb-deployments/commit/e000c76591c122f12c464ee59eb29eb3669311ecterminationGracePeriodSeconds
on your worker pods is sufficiently high. It should be at least as high as maximumExecutionTimeout
. That way Kubernetes will not forcefully terminate your worker pods while actions are still running.stopGracefully
on the gRPC server on the bb_runner side. That way bb_runner will not shut down prematurely. https://github.com/buildbarn/bb-storage/blob/e29de79751922fd901a9bfe0015c865eb8dcf745/pkg/proto/configuration/grpc/grpc.proto#L216-L237
We've deployed buildbarn in kubernetes cluster and want to automatically scale remote workers up and down. If I use a simple command to downscale deployment that manages remote workers bazel reports an error and doesn't retry anymore.
Example of downscale command:
Error reported by bazel:
--remote_retries
is set to 5bazel version: 6.3.2
buildbarn versions: scheduler: 20230808T060019Z-22e8ab3 worker: 20230308T094934Z-44790d8