elastic / elasticsearch

Free and Open, Distributed, RESTful Search Engine
https://www.elastic.co/products/elasticsearch
Other
68.54k stars 24.35k forks source link

[Transform] Telemetry, Metrics, and Profiling #109555

Open prwhelan opened 3 weeks ago

prwhelan commented 3 weeks ago

Description

We should look into making Transform performance issues easier to debug using the existing tools supported in Kibana/Elasticsearch. The goal is that users can leverage the existing tools to investigate issues.

Look into:

Already have a preliminary metric for "overhead" and "cardinality" running locally, as well as forwarding the initial tracer when the Transform is started.

### Tasks
- [ ] [Transform] Create hook in AsyncTwoPhase for start/stop checkpoints
- [ ] [Transform] Additional metrics: last*, maxPageSearchSize used for the checkpoint
elasticsearchmachine commented 3 weeks ago

Pinging @elastic/ml-core (Team:ML)

prwhelan commented 3 weeks ago

Notes:

Decisions:

Questions:

  1. How much does this cost?
  2. Should we include errors initially? It doesn't look like too many people are using errors. If so, how do we correlate them to log messages?