ray-project / kuberay

A toolkit to run Ray applications on Kubernetes
Apache License 2.0
974 stars 330 forks source link

[Perf] Add a CPU-based training workload #2116

Closed kevin85421 closed 2 months ago

kevin85421 commented 2 months ago

Why are these changes needed?

Our scalability test requires some CPU-based workloads. This PR uses CPU to train a MNIST model. I also can run the RayJob successfully on my devbox.

Screenshot 2024-05-03 at 10 48 05 AM

Related issue number

Checks

kevin85421 commented 2 months ago

cc @andrewsykim

andrewsykim commented 2 months ago

LGTM! Tested this locally and confirmed it works. Thanks @kevin85421!