AbsaOSS / spot

Aggregate and analyze Spark history, export to elasticsearch, visualize and monitor with Kibana.
Apache License 2.0
5 stars 0 forks source link

Investigate multi-cluster setup #39

Closed DzMakatun closed 3 years ago

DzMakatun commented 3 years ago

investigate monitoring across multiple clusters (Spark History Server / Menas instances) Idea:

Open questions:

DzMakatun commented 3 years ago

Solved within pr #44 It is possible to monitor multiple clusters (each with its own Spark History server) with Spot. For this scenario a separate Spot crawler process needs to be running for each Spark History server (and optionally Menas). Each process writes to its own set of indexes within the same elasticsearch instance. If the index names follow the defined pattern (spot_\<raw/agg/err>_\<clustername>\\<id>) the data can be visualized in Kibana using the setup provided in spot/kibana/. There the data can be filtered by history_host.keyword if required.