Investigate multi-cluster setup

AbsaOSS / spot

Aggregate and analyze Spark history, export to elasticsearch, visualize and monitor with Kibana.

Apache License 2.0

5 stars 0 forks source link

investigate monitoring across multiple clusters (Spark History Server / Menas instances) Idea:

One crawler process per cluster
Each crawler writes to its own set of indexes in the same elasticsearch instance
Kibana index patterns should be able to cover multiple indexes
The collected data can be filtered by history_host attribute (raw, aggs, errors(?))

Open questions:

How much flexible are Kibana's index patters?
How would this interfere with rotating indexes? (For limiting the used storage space)
how such setups will work for regression process?

AbsaOSS / spot