skypilot-org / skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
https://skypilot.readthedocs.io
Apache License 2.0
6.81k stars 513 forks source link

[WIP] Advanced DAG Workflow. #4319

Open cblmemo opened 1 week ago

cblmemo commented 1 week ago

A PR to stash our progress for advanced. This is in experimental stage and we need to discuss more on the API & UX issue.

Credit to @andylizf for the amazing contributions!

Tested (run the relevant ones):