Closed c3-clement closed 2 months ago
Hello @adejanovski @rzvoncek
FYI, I'm on my way to submit a PR to address this issue.
Thanks @adejanovski . When can we expect a release?
@c3-clement Medusa 0.22.3 is out with this patch in it. The k8ssandra-operator will come out, it seems, next week.
Project board link
See k8ssandra-operator issue: https://github.com/k8ssandra/k8ssandra-operator/issues/1406
What happened?
I deployed a K8ssandraCluster with 96 replicas and medusa enabled, and one of the pods did not reach the Readiness probe
I identified the faulty pod: It was not reaching readiness probe because of the
medusa
container. Themedusa
gRPC server did not start becauseload_config()
failed (see logs below). Since the gRPC server was not started, the readiness probe was not reached.The
medusa
container was "blocked" and did not attempt to restart the gRPC server. I restarted the pod manually by deleting it, and the medusa gRPC server started successfully.Did you expect to see something different?
I expect the pod to restart and to be in
CrashLoopBackOff
phase if a uncaught exception is raised by the medusa python process, instead of blocking indefinitely.I believe this behavior was introduced by the following change : https://github.com/thelastpickle/cassandra-medusa/pull/731
How to reproduce it (as minimally and precisely as possible): Start the medusa container with an invalid configuration
Environment
Medusa logs
┆Issue is synchronized with this Jira Story by Unito ┆Reviewer: Alexander Dejanovski ┆Fix Versions: 2024-10 ┆Issue Number: MED-97