kserve / website

User documentation for KServe.
https://kserve.github.io/website/
Apache License 2.0
103 stars 109 forks source link

Point users to vLLM production server #362

Closed dulacp closed 1 month ago

dulacp commented 1 month ago

The vLLM teams states that the vllm.entrypoints.api_server is just to demonstrates usage of their AsyncEngine, for production use they point users to vllm.entrypoints.openai.api_server instead.

Proposed Changes

netlify[bot] commented 1 month ago

Deploy Preview for elastic-nobel-0aef7a ready!

Name Link
Latest commit 742e5dfe85348adb562b598e3909cfe7272580eb
Latest deploy log https://app.netlify.com/sites/elastic-nobel-0aef7a/deploys/663dde044bb6b0000881de61
Deploy Preview https://deploy-preview-362--elastic-nobel-0aef7a.netlify.app
Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

yuzisun commented 1 month ago

/lgtm /approve

oss-prow-bot[bot] commented 1 month ago

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dulacp, yuzisun

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files: - ~~[OWNERS](https://github.com/kserve/website/blob/main/OWNERS)~~ [yuzisun] Approvers can indicate their approval by writing `/approve` in a comment Approvers can cancel approval by writing `/approve cancel` in a comment