skypilot-org / skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
https://skypilot.readthedocs.io
Apache License 2.0
6.82k stars 513 forks source link

[k8s] Move setup and ray start to pod args to make them async #4389

Closed Michaelvll closed 3 hours ago

Michaelvll commented 1 day ago

Moved setup/ray start to the kubernetes pod args to make them async

TODO:

Tested (run the relevant ones):

Michaelvll commented 3 hours ago

Tested: