Open alita-moore opened 2 weeks ago
This randomly started working 🤷♀️
Maybe it was spending time to initialize / pull packages or checkpoints?
I'm not sure, I didn't see the logs moving at all / the init script had finished. I'll let you know if it happens again.
It might be some availability issue with runpod, i.e. SkyServe kept trying to get resources from RunPod but failed due to availability. There will be some useful logs with sky serve logs sky-service-0fc2 3
which will show the logs for that specific replica.
I am trying to run a docker server on runpod that utilizes a fastapi server on port 8000, when I provision the resources the system gets stuck at status "STARTING". Here are the details:
with the following log outputs
and
Version & Commit info:
sky -v
: skypilot, version 0.7.0sky -c
: 3f625886bf1b13ee463a9f8e0f6741f620f7f66f