cortexproject / cortex-helm-chart

Helm chart for Cortex
Apache License 2.0
149 stars 164 forks source link

Disable livenessprobe by default #484

Closed friedrichg closed 4 months ago

friedrichg commented 1 year ago

I have run all cortex components for years without liveness probes. they are bad for store-gateways because it causes re-sharding if they are under pressure. Restarting store-gateways can take a long time to recover, same for others components. We should disable all of them by default and let users enable them as needed.

Similar to https://github.com/cortexproject/cortex-helm-chart/pull/263

friedrichg commented 8 months ago

Also bad for compactors as confirmed today

aep commented 4 months ago

they also seem to be designed wrong?

the ready endpoint for store-gateway returns

Some services are not Running:
Running: 3
Starting: 1

so store-gateway is being killed because some other service isnt ready, but the other service doesnt have a probe?

nschad commented 4 months ago

+1

We should just remove it.

@friedrichg

Should we just drop livenessProbe for store-gateway and compactor then?

friedrichg commented 4 months ago

@nschad yes, please

nschad commented 4 months ago

Sorry for the hold-up

PR is open #502

@friedrichg