SG-Lang Runtime Stuck Launching in Docker Container

sgl-project / sglang

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Apache License 2.0

2.75k stars 177 forks source link

We're trying to run the latest version of sg-lang in a Docker Container (PyTorch 2.3.0, CUDA 12.1) -- but the runtime instantiation gets stuck. It's start loading the model onto the GPU and then hangs.

We've been able to run sg-lang without any problems on the host operating system. So we pip froze the requirements on the host instance and installed these exact packages within the Docker Container -- but we're still hitting this model loading hang.

Has anyone seen this issue before? Any ideas what might be going wrong?

sgl-project / sglang

SG-Lang Runtime Stuck Launching in Docker Container #527