issues
search
ray-project
/
kuberay
A toolkit to run Ray applications on Kubernetes
Apache License 2.0
963
stars
328
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
added autoscaling support to Python APIs
#2159
blublinsky
closed
1 month ago
3
[Bug] Readiness probe failed: timeout on minikube
#2158
anovv
opened
1 month ago
7
[Feature] Should we also set PublishNotReadyAddresses if the service is not headless?
#2157
rueian
opened
1 month ago
3
[perf-tests] make the bucket name and prefix configurable for ray data image resize job
#2156
andrewsykim
closed
1 month ago
0
[Feature] Checkpoint API to recover from checkpoint from previous runs
#2155
sathyanarays
closed
3 weeks ago
2
[Bug] RayJob falsely marked as "Running" when driver fails
#2154
sathyanarays
opened
1 month ago
3
FT GCS should handle draining of node where head pod is scheduled
#2153
abatilo
opened
1 month ago
3
[Chore] Use new golangci-lint rules only for ray-operator
#2152
MortalHappiness
closed
1 month ago
0
[Bug] "unable to find head service" error when specifying app.kubernetes.io/name on headGroupSpec
#2151
jonapgar-groupby
closed
1 month ago
3
[RayCluster][Fix] Add expectations of RayCluster
#2150
Eikykun
opened
1 month ago
13
[perf-test] update 100 RayJob perf tests to use PyTorch trainer and Ray Data examples
#2149
andrewsykim
closed
1 month ago
1
Improve the logs before creating the ray cluster
#2148
oksanabaza
closed
1 week ago
0
[Bug] RayJob does not work when `app.kubernetes.io/name` is set
#2147
kwohlfahrt
closed
1 month ago
3
[Feature] Why RayJob Spec can't set EndpointMemory?
#2146
meibenjin
closed
1 month ago
2
[Docs][Development] Delete linting docs
#2145
MortalHappiness
closed
1 month ago
1
[Style] Fix golangci-lint rule: govet
#2144
MortalHappiness
closed
6 days ago
1
[Style] Fix golangci-lint rule: unconvert
#2143
MortalHappiness
closed
1 month ago
1
[Style] Fix golangci-lint rule: noctx
#2142
MortalHappiness
closed
1 month ago
0
[Style] Fix golangci-lint rule: errorlint
#2141
MortalHappiness
closed
1 month ago
3
[Fix][precommit] Fix pre-commit golangci-lint always succeed
#2140
MortalHappiness
closed
1 month ago
0
[Chore] Turn off no-commit-to-branch rule
#2139
MortalHappiness
closed
1 month ago
0
[4/N][Chore] Turn off golangci-lint rules except ray-operator
#2138
MortalHappiness
closed
1 month ago
0
[Feature] RayService CRD to have ImagePullSecret Reference
#2137
roverkinz
opened
1 month ago
0
[Feature] RayCluster Helm Chart: Add pod level securityContext in addition to container level securityContext
#2136
arueth
closed
1 month ago
3
[Perf] Add a CPU-based image resizing workload using Ray Data
#2135
kevin85421
closed
1 month ago
0
Add test for configurable k8s job backoff limit
#2134
jjyao
closed
1 month ago
0
[5/N][Refactor] Run golangci-lint for all files (only autofix rules)
#2133
MortalHappiness
closed
1 month ago
0
[2/N][Refactor] Run golangci-lint for apiserver
#2132
MortalHappiness
closed
1 month ago
0
[1/N][Refactor] Run golangci-lint for ray-operator
#2131
MortalHappiness
closed
1 month ago
0
[2/N][Refactor] Run pre-commit for all files (without golangci-lint)
#2130
MortalHappiness
closed
1 month ago
1
[3/N][CI] Replace lint CI with pre-commit
#2129
MortalHappiness
closed
1 month ago
2
[N/N][Chore] Add golangci-lint rules
#2128
MortalHappiness
closed
1 month ago
0
[1/N][Chore] Add pre-commit hooks
#2127
MortalHappiness
closed
1 month ago
1
[Perf] Add NUM_WORKERS and CPUS_PER_WORKER env to the mnist workload
#2126
rueian
closed
1 month ago
0
[Bug] [raycluster-controller] Kuberay cannot recreate new raycluster header pod when it has been evicted by kubelet as disk pressure
#2125
xjhust
opened
1 month ago
25
[Refactor] Follow-up for PR 1930
#2124
MortalHappiness
closed
1 month ago
1
[Cherry-pick][Hotfix][CI] Pin setup-envtest dep (#2038)
#2123
kevin85421
closed
1 month ago
0
[Telemetry] Update KUBERAY_VERSION
#2122
kevin85421
closed
1 month ago
0
[Cherry-pick][CI] Pin kustomize to v5.3.0 (#2067)
#2121
kevin85421
closed
1 month ago
0
[Cherry-pick][Bug] All worker Pods are deleted if using KubeRay v1.0.0 CRD with KubeRay operator v1.1.0 image (#2087)
#2120
kevin85421
closed
1 month ago
0
[Cherry-pick][Bug] Ray operator crashes when specifying RayCluster with resources.limits but no resources.requests (#2077)
#2119
kevin85421
closed
1 month ago
0
[CI] Remove unnecessary sample YAML symbolic links
#2118
kevin85421
closed
1 month ago
0
[Feat][RayCluster] Make the Head service headless
#2117
rueian
closed
1 month ago
11
[Perf] Add a CPU-based training workload
#2116
kevin85421
closed
1 month ago
2
Add HeadInfo.Ready status for RayCluster
#2115
Yicheng-Lu-llll
opened
1 month ago
0
[Feature] [API Server] [RFC] Add persistence for job history using a SQL database
#2114
han-steve
opened
1 month ago
0
[Chore] Delete redundant pod existance checking
#2113
MortalHappiness
closed
1 month ago
0
[Feature] Upgrade ray version to 2.20
#2112
can-anyscale
closed
1 month ago
1
[Test] Move StateTransitionTimes envtest to a better place
#2111
kevin85421
closed
1 month ago
1
[Perf] Improve perf-test YAMLs and README
#2110
kevin85421
closed
2 months ago
1
Previous
Next