elastic / apm

Elastic Application Performance Monitoring - resources and general issue tracking for Elastic APM.
https://www.elastic.co/apm
Apache License 2.0
384 stars 114 forks source link

[APM] Service instance runtime metrics #301

Open alex-fedotyev opened 4 years ago

alex-fedotyev commented 4 years ago

Summary of the problem (If there are multiple problems or use cases, prioritize them) Currently APM agents collect various system and runtime metrics, which could help detecting resource saturation or configuration issues. Visualizing these metrics for every agent type would make this information actionable during performance issues troubleshooting.

User stories

List known (technical) restrictions and requirements Has to work with different agent types and appreciate that each runtime has its own specific runtime metrics.

If in doubt, don’t hesitate to reach out to the #observability-design Slack channel.

elasticmachine commented 4 years ago

Pinging @elastic/observability-design (design)

sorenlouv commented 4 years ago

We have three issues for runtime metrics:

Are all of them needed? I'm not sure what the purpose of the meta issue.

sorenlouv commented 4 years ago

Visualizing these metrics for every agent type would make this information actionable during performance issues troubleshooting.

What are "these metrics"? Currently we show CPU and memory metrics for each agent (except java agent).

Do we want to keep showing metrics as averages across all hosts / vms / containers or are we going to show them per container like we do for java?