dstackai / dstack

dstack is an open-source alternative to Kubernetes, designed to simplify development, training, and deployment of AI across any cloud or on-prem. It supports NVIDIA, AMD, and TPU.
https://dstack.ai/docs
Mozilla Public License 2.0
1.42k stars 150 forks source link

[Feature]: Allow running services and models without a gateway #1595

Open peterschmidt85 opened 2 months ago

peterschmidt85 commented 2 months ago

Use case:

Suggestion:

Motivation:

peterschmidt85 commented 2 months ago

Another reason why this issue is important is that in our documentation, we often use tasks for deploying models because it doesn’t require creating a gateway, which is also not supported for all backends. However, when a model is running as a task, the user cannot use the UI to interact with the model.

jvstme commented 1 month ago

Implementation progress

Usage instructions for the current prototype

  1. Run dstack server with the DSTACK_FF_PROXY=1 environment variable.
  2. Make sure your project doesn't have a gateway or neither of the gateways is default.
  3. Run a service.
  4. Your service is now available at <dstack-server-base-url>/proxy/services/<project-name>/<run-name>/
peterschmidt85 commented 1 month ago

Plus, ensure the UI can leverage the built-in proxy