skypilot-org / skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
https://skypilot.readthedocs.io
Apache License 2.0
6.82k stars 513 forks source link

Set minimum port number a Ray worker can listen on to 11002 #4278

Closed cbrownstein-lambda closed 2 weeks ago

cbrownstein-lambda commented 2 weeks ago

This pull request sets the minimum port number a Ray worker can listen on to 11002. Without this change, when running examples/k8s_cloud_deploy/launch_k8s.sh, k3s/containerd will fail with the error:

/var/lib/rancher/k3s/agent/containerd/containerd.log
time="2024-11-06T18:12:48.119844117Z" level=fatal msg="Failed to run CRI service" error="stream server error: listen tcp 127.0.0.1:10010: bind: address already in use"

See: