kubernetes / perf-tests

Performance tests and benchmarks
Apache License 2.0
884 stars 520 forks source link

Check watch terminations from clusters loader tests #2054

Open wojtek-t opened 2 years ago

wojtek-t commented 2 years ago

One of the important metrics that may suggest overload of the control plane is the number of watches that are closed by kube-apiserver because they don't keep up (or watchcache itself is not keeping). We want to add a check to our tests that will be validating if this metrics is not too high.

Metrics to exercise:

The easiest way to do it is probably add to prometheus-based measurement, but @marseel to confirm.

marseel commented 2 years ago

I believe the easiest way is to add new prometheus query to GenericPrometheusQuery here: https://github.com/kubernetes/perf-tests/blob/1f00a7787f9f47bad251d4e791a0117ad6fe7e1d/clusterloader2/testing/load/modules/measurements.yaml#L122 instead of creating new measurement.

anshulinteg commented 2 years ago

/assign

k8s-triage-robot commented 2 years ago

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

You can:

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

marseel commented 2 years ago

/remove-lifecycle stale

bouaouda-achraf commented 1 year ago

/assign

wojtek-t commented 1 year ago

The only missing thing now is to enable the new measurement in our tests, right?

marseel commented 1 year ago

Enabling measurement and then based on results possibly adding alerting to it.