apache / solr-operator

Official Kubernetes operator for Apache Solr
https://solr.apache.org/operator
Apache License 2.0
242 stars 112 forks source link

Liveness probe failing for Prometheus Exporter connected to a large SolrCloud #693

Closed HoustonPutman closed 3 months ago

HoustonPutman commented 3 months ago

When running a Prometheus Exporter that is connected to a SolrCloud with many nodes/pods, the livenessProbe can fail with a EOF exception no matter what resources the pods are given.

I believe this is due to the kubelet not being able to handle a large amount of data being sent back in a probe response.

In order to fix this we should request back no metrics in the liveness probe. That way the response is empty.