-
Hi
While using your exporter and the SLURM grafana dashboard, I noticed that those metrics are not exposed:
```
"expr": "slurm_account_cpus_running"
"expr": "slurm_account_jobs_pending"
"expr":…
-
**Agent Environment**
- Datadog Agent Version: `gcr.io/datadoghq/agent:7.54.0-jmx`
- Relevant configuration: Using the Datadog Agent in a Kubernetes pod.
**Describe what happened:**
The Datadog …
-
### Is your feature request related to a problem? Please describe
vmstorage lacks on reporting information about resource usage. Some info like cache size, number of concurrent requests, disk IO, C…
-
**Please provide an in-depth description of the question you have**:我是使用kubesphere安装的prometheus,我应该如何添加GPU的监控
**What do you think about this question?**:
**Environment**:
- HAMi version:
- Kuber…
-
Required k8s persistent volume & filesystem level metrics along with their grafana dashboards and few sane alerts preferably in [kube-prometheus mixin](https://github.com/kubernetes-monitoring/kuberne…
-
### What is the version?
3.3.5-3.4.1
### What happened?
Metrics like `DCGM_FI_PROF_GR_ENGINE_ACTIVE` are only exposed for one single pod even though there are multiple pods that use the same GPU
#…
-
### Component(s)
collector, target allocator
### Is your feature request related to a problem? Please describe.
I would like to use a ServiceMonitor to scrape metrics from all of the kubelets…
-
## Context
We would like to monitor the number of drifting layers, with a metric monitoring tool e.g. Prometheus.
## Feature requested
- exposed /metrics endpoint readable by prometheus
- `burr…
-
Just a thought, it would be really nice that if the worker's sent health metrics to Orchard and this was exposed via the /health endpoint.
Things like CPU, Disk etc. Almost like a prometheus node_…
-
The internal Metrics exposed by clustered VictoriaMetrics are not well documented. There is nowhere you can find a description as to what an individual metric means. Some metrics are confusingly named…