o19s / haystackhack24

Hackday at Haystack US 24
0 stars 0 forks source link

Solr Overseer / Zookeeper single point of contention #1

Open cfournie opened 2 months ago

cfournie commented 2 months ago

Overseer can become the bottleneck in very large clusters; let's:

  1. Look at how Overseer works
  2. Define the problem
  3. Look at existing solution attempts to see where we could contribute
cfournie commented 2 months ago

Reasons why work can build up on the overseer:

morissm commented 2 months ago

If search/indexation contention on the overseer is the issue, separating overseer and data nodes might be a solution. https://solr.apache.org/guide/solr/latest/deployment-guide/node-roles.html

cfournie commented 2 months ago

Changes proposed to Solr include:

cfournie commented 2 months ago

How Overseer works:

cfournie commented 2 months ago

Mitigation of this by creating separate Overseer nodes using node roles would be interesting, but unfortunately right now if using the k8s solr-operator to manage your cluster it does not support cluster roles yet.