-
### What would you like to be added?
Adding the fullNameOverride support will solve the max character issue.
### Why is this needed?
There are long cluster names for the helm chart installation, ca…
-
### 1. Quick Debug Information
* Kubernetes Version: v1.28
* GPU Operator Version: v24.6.1
### 2. Issue description
The Kubernetes cluster has two worker nodes and each contains four A100 GPUs…
-
Hello maintainters!
In [the release note of 24.08](https://docs.nvidia.com/deeplearning/triton-inference-server/release-notes/rel-24-08.html#rel-24-08), there is a known issue which is
> Triton met…
-
### Please confirm the following
- [X] I agree to follow this project's [code of conduct](https://docs.ansible.com/ansible/latest/community/code_of_conduct.html).
- [X] I have checked the [current is…
-
**Output of the info page (if this is a bug)**
```
Getting the status from the agent.
==============
Agent (v6.6.0)
==============
Status date: 2018-11-15 10:48:23.952742 UTC
Pid: 3…
-
/kind bug
**1. What `kops` version are you running? The command `kops version`, will display
this information.**
1.28.4
**2. What Kubernetes version are you running? `kubectl version` will pr…
-
At the moment, we detect the difference between CronJobs and Jobs by the presence of timestamp at the end of the Job name. We keep track of these failures separately, and then we increment the `kubern…
-
**What happened**:
After noticing some "anomaliles" in some metrics, did a little Googling and checked the kube-state-metrics pod log. Found numerous messages like the following:
```
E1205 05…
-
### Component(s)
receiver/prometheus
### What happened?
## Description
I want to use `scrape_config_files` to add prometheus job. However, I found that the prometheus job defined in this way are n…
-
# Alert KubeClientCertificateExpiration firing in kube-system namespace
This is an automated issue created by the monitoring system. Please do not edit this message.
Alertmanager URL: https://alertm…