-
### Search before asking
- [X] I had searched in the [issues](https://github.com/ray-project/kuberay/issues) and found no similar feature requirement.
### Description
See discussion in https://ray…
-
### Enhancement Description
- One-line enhancement description (can be used as a release note): Configurable tolerance for Horizontal Pod Autoscalers
- Kubernetes Enhancement Proposal: https://git…
-
### What happened + What you expected to happen
I have found that Ray autoscaler sometimes mistakenly kills some nodes that are working. My scenario is that 400 Ray Tasks are submitted at the same ti…
-
**Which component are you using?**:
Horizontal workload autoscaler.
**What version of the component are you using?**:
Not relevant.
**What k8s version are you using (`kubectl version`)?**:…
-
**Which component are you using?**:
cluster autoscaler
**Is your feature request designed to solve a problem? If so describe the problem this feature should solve.**:
We want to add IBM I…
-
### Description
## Situation
We're running a ML server with Ray Serve on Google Cloud. Nowadays, the zone I'm using on GC is suffering "out of GPUs" issue, and on those times, we cannot scale up sin…
-
**Which component are you using?**:
cluster-autoscaler
**Is your feature request designed to solve a problem? If so describe the problem this feature should solve.**:
Cluster-autoscaler…
-
### What happened + What you expected to happen
I have deployed an autoscaling cluster using kuberay. It looks like in our cluster sometimes there are failures in connecting to the k8 API server. W…
-
### Preflight Checklist
- [X] I agree to follow the [Code of Conduct](https://github.com/deckhouse/deckhouse/blob/main/CODE_OF_CONDUCT.md) that this project adheres to.
- [X] I have searched the […
-
### What happened + What you expected to happen
I am using ray autoscaler for 16 workers each with 1NPU, And Ray head error during autoscaler opened for 23hours,the logs as following:
The autoscaler…