apache / solr-operator

Official Kubernetes operator for Apache Solr
https://solr.apache.org/operator
Apache License 2.0
247 stars 112 forks source link

solr prometheus exporter crashloopbackoff #497

Open vipul-06 opened 1 year ago

vipul-06 commented 1 year ago

I have deployed solr prometheus exporter for monitoring purpose. It is running on gke but my exporter pod is gettting error of crashloopbackoff. The logs of my exporter pod are like this INFO - 2022-11-10 05:45:56.943; org.apache.solr.common.cloud.ConnectionManager; Waiting for client to connect to ZooKeeper INFO - 2022-11-10 05:45:56.991; org.apache.solr.common.cloud.ConnectionManager; zkClient has connected INFO - 2022-11-10 05:45:56.991; org.apache.solr.common.cloud.ConnectionManager; Client is connected to ZooKeeper INFO - 2022-11-10 05:45:57.003; org.apache.solr.common.cloud.ZkStateReader; Updated live nodes from ZooKeeper... (0) -> (5)

HoustonPutman commented 1 year ago

Can you share the yamls for your solrcloud and solrprometheusexporters? Hard to debug without that.

HoustonPutman commented 1 year ago

is that the same basic auth secret that the SolrCloud is setup to use? have you changed the solrcloud security after creating the cloud? The default roles provided by the Solr Operator should allow for calling the ping handler.

Can you check your solr cloud security json and see if the user in that basic auth secret is allowed to use the ping handler?

fliphess commented 1 year ago

I ran into a similar issue where the prometheus exporter kept crashing: Changing the CPU limits on the prometheus exporter did the trick for me: Otherwise every collection round it crashes because it's being throttled by the CPU.