-
Checklist:
* [ ] I've searched in the docs and FAQ for my answer: https://bit.ly/argocd-faq.
* [ ] I've included steps to reproduce the bug.
* [ ] I've pasted the output of `argocd version`.
…
-
### 1. Issue or feature description
I use GPU pod to run pytorch processes with the device plugin, and met the problem occasionally which shows "CUDA unknown error". But after I killed the nvidia-dev…
chxk updated
9 months ago
-
For the moment Kepler scrapes metrics from all namespaces by default. However, I want it to only scrape metrics from a specific namespace where Kepler is deployed.
I did convert the DaemonSet to a …
-
"csi-node-driver" daemonset is using default service account in calico version - v.3.24.x
## Expected Behavior
csi-node-driver daemonset should use a custom service account.
## Current Behavi…
-
### Preflight Checklist
- [X] I agree to follow the [Code of Conduct](https://github.com/deckhouse/deckhouse/blob/main/CODE_OF_CONDUCT.md) that this project adheres to.
- [X] I have searched the [iss…
-
### What's wrong?
- I have clustering enabled and I am only collecting pod metrics currently for simplicity, but I metrics are still being double collected leading to out of order errors in my alloy …
-
Checklist:
* [ ] I've searched in the docs and FAQ for my answer: https://bit.ly/argocd-faq.
* [ ] I've included steps to reproduce the bug.
* [ ] I've pasted the output of `argocd version`.
…
-
I had set up CAA on Azure following the website instructions. The nginx deployment worked fine for some time but something makes the pod crashing/restarting and eventually it gets stuck with `Containe…
mythi updated
3 months ago
-
I'm running Openshift in version 4.10.16 with the Nvidia GPU Operator certified by RedHat in version 1.11.1 and I'm trying to use GPU passthrough for VMs in one of my nodes (the node is bare metal).
…
-
### Describe the bug
For GCP deployments that want to use A100 (or similar) GPUs, there is no way of making sure the nvidia drivers are installed ([via this daemonset](https://github.com/nebari-dev/n…