opensearch-project / opensearch-spark

Spark Accelerator framework ; It enables secondary indices to remote data stores.
Apache License 2.0
22 stars 33 forks source link

[FEATURE] Add metrics in spark #117

Closed penghuo closed 8 months ago

penghuo commented 1 year ago

Requirements

Tasks

Metrics

Dependency Services

Interactive Job, dimensions: [clientId, domainName, instance]

repl status

Streaming Job, dimensions: [clientId, domainName, instance, type]

Query Optimizer

noCharger commented 11 months ago
  1. Developed a high-level sequence diagram following offline discussions with @penghuo.

SeqDiagram

  1. To progress, we need to clarify which components are currently in place and identify those requiring development. Key tasks include:
penghuo commented 11 months ago

limitation: Per account limitation is 300tps

noCharger commented 10 months ago

[Summary] Flint Metrics Framework

Approach one: Codahale/Dropwizard aggregated metrics

Pros:

Cons:

Approach two: CloudWatch Agent Integration

Pros:

Cons:

Approach three: Leverage Spark event logs

Spark's event logs, a comprehensive record of events during a Spark application's execution, can indeed be utilized for parsing metrics, although this method differs from the direct use of the Dropwizard Metrics Library. These logs, typically in JSON format, include detailed information about various Spark activities and can be accessed and analyzed post-execution for insights into job performance and system behavior. While the Dropwizard Metrics Library offers real-time metrics for ongoing monitoring, Spark event logs are more suited for retrospective analysis, debugging, and performance audits. Extracting metrics from these logs involves parsing the JSON data to identify relevant metrics, a process that can be resource-intensive and is generally more complex than real-time monitoring. Therefore, while Spark event logs provide a valuable resource for detailed analysis after the fact, they serve a different purpose compared to the immediate insights offered by Dropwizard's real-time metrics.

Cost Analysis:

cc: @anirudha @penghuo @vamsi-amazon

noCharger commented 8 months ago

close as completed