-
This issue is just a placeholder for the task of adding a Metrics-For-Job Bubble Chart.
I haven't submitted PRs in a little while, so I figured I could use this issue as a method of communication a…
-
### What happened + What you expected to happen
It will be great if we could export metrics of job submission, e.g.
- number of failed jobs
- number of success jobs
### Versions / Dependencies
m…
-
### What you would like to be added?
tf-job-operator v1.0 metrics can expose specific failed pods
The logging details are as follows
`content="{'filename':'record/event.go:221','level':'inf…
-
### Component(s)
exporter/googlemanagedprometheus
### Describe the issue you're reporting
I'm talking about [the following example][a].
> The order in each section below is the best practice. Re…
-
### Proposal
Hello everyone,
Would it be possible to add the sd_configs to the global option, which has the same behavior as evaluation_interval or other global variables?
Example:
global:
ev…
-
A UA cluster was observed to have identical metrics for both the number of scheduled backups that succeed in both the system and application tenant and the number that failed in each, despite the jobs…
-
**Describe the bug**
Local LLMs either raise Timeout error or Fails to parse output.
Ragas version: 0.1.15
Python version: 3.11.3
**Code to Reproduce**
```python
from transformers import Aut…
-
### Proposal
Add support for the `ui` block to be defined in `job.group` and `job.group.task`. Additionally, add support for alloc-specific ui links. Both could be achievable via some sort of url t…
-
Is there a limit of how many metrics that can be sent to datadog per workflow run? We have a pretty complicated workflow and I'm noticing that there are no metrics for certain jobs. Here's an example:…
-
Extension of #64933. Need other information about changefeeds to show users per changefeed, namely:
* Memory usage
* Buffer size
* Error count
* Records sent in the last hour
Will need to cha…