-
## Is your feature request related to a problem? Please describe (👍 if you like this request)
Feature request proposed by @YvanGuidoin . See https://github.com/longhorn/longhorn-manager/pull/3034#i…
-
Janus should emit a histogram metric tracking the size of each aggregation job created in reports.
-
**Describe the bug**
Jobs from real QPUs have a `metrics` method that allows for grabbing timing information. However this is not the case if passing a simulator where one gets the error:
```pyth…
-
If a report job fails, the failure is marked as an "unsuccessful" task outcome by Task Manager metrics. However, there are certain cases where a failure can be due to user configuration and would be e…
-
**What would you like to be added**:
We would like to propose a new feature in Kueue that enables dynamic scaling of job parallelism and resource allocation (CPU, RAM, and pods) based on job backlo…
-
We have a simple emergency metrics that writes to local sqlite db.
We want a metrics system that:
- has more robust collection that current cli output parsing
- ships metrics up to job-server…
-
**Is your feature request related to a problem? Please describe**.
Regular ACAs have a good set of built-in metrics: replica count, CPU and memory utilization.
I can't find anything like that for …
-
Not in any particular order:
**Define data, metrics we want to capture**
- Stage in, stage out, processing
- CPU, Memory usage, Network IO
- Job Inputs (stac + params)
- Job Outputs
- CWL run logs
*…
-
hey @znerol, thank you for creating this helpful exporter :raised_hands:
i'd like to track and set up alerts for failed or absent backups, replications, and on high IO delay (the one that's displa…
-
[ ] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
**Describe the bug**
I set up a defination to split the input Dataset into several…