-
Hi I am using this guide on [this](https://github.com/eduardolucioac/okd_bare_metal#create-the-workers-nodes-okd_worker_1-and-okd_worker_2) step worker node is staying on `GET https://api-int.mbr.doma…
-
### What happened?
Running [CIS benchmark](https://github.com/aquasecurity/kube-bench) results to failures:
```
[INFO] 4 Worker Node Security Configuration
[INFO] 4.1 Worker Node Configuration…
-
**Rancher Server Setup**
- Rancher version: 2.6-head commit id: `4206f57`
- Installation option (Docker install/Helm Chart): Helm
- If Helm Chart, Kubernetes Cluster and version (RKE1, RKE2, k3s…
-
/kind feature
**Describe the solution you'd like**
Currently, CAPI will spread control plane machines across the reported failure domains (i.e. availability zones). It doesn't do this for worker …
-
**Rancher Server Setup**
- Rancher version: 2.8.4 and 2.8.5
- Installation option (Docker install/Helm Chart): docker install
**Information about the Cluster**
- Kubernetes version: `v1.28.10+rk…
-
Hi,
We have multiple issues related to the cluster syncing status. I think all of them can be solved with a startup probe that marks the pod as running only after cluster_node_state is synced.
#…
-
### What happened + What you expected to happen
I'm serving ML models using Ray serve on Google Cloud.
**After upgrading Ray version from 2.6.3 to 2.7.1**, there is a bug when launching new worker…
-
Trying to connect to a invalid/non exisiting/offline tcp socket/server, causes to spwan worker, but do not exit.
This creates a memory leak.
Related/debugging information: #61
A watch on `watch…
-
### What happened + What you expected to happen
**1. Bug**
When running Ray on a Slurm Cluster it seems like Ray RLlib does not respect which nodes are specified as the head and worker nodes with th…
-
Have spoken to @yankcrime last week about adding support for another Cluster API Provider and after some additional thought and research have decided against CAPH for now and am interested in looking …