Closed andrewsykim closed 6 days ago
@kevin85421
Great! Would you mind adding a README to briefly document the benchmark results we have? Thanks!
I updated the README with some references on how to understand the results
Btw, what's the configuration of KubeRay (e.g. CPUs? memory? reconcile-concurrency)? Thanks!
We used 16 CPU requests and 32GiB memory limit with --reconcile-concurrency=5. https://github.com/ray-project/kuberay/pull/2228 was an important fix for the RayJob scalability tests.
In practice we didn't need that much. This is the CPU / memory graphs from the most recent run:
Why are these changes needed?
Per https://github.com/ray-project/kuberay/issues/2069, adds 1K and 5K RayCluster / RayJob test results
Related issue number
https://github.com/ray-project/kuberay/issues/2069
Checks