awslabs / data-on-eks

DoEKS is a tool to build, deploy and scale Data & ML Platforms on Amazon EKS
https://awslabs.github.io/data-on-eks/
Apache License 2.0
617 stars 210 forks source link

docs: Website documentation for vllm inferencing using rayserve on AWS Inferentia #637

Closed sindhupalakodety closed 1 week ago

sindhupalakodety commented 3 weeks ago

…n AWS Inferentia

What does this PR do?

🛑 Please open an issue first to discuss any significant work and flesh out details/direction - we would hate for your time to be wasted. Consult the CONTRIBUTING guide for submitting pull-requests.

Created a website documentation for vllm inferencing using rayserve on AWS Inferentia

Motivation

To have users use vllm in association with RayServe and Trainium and to use llmperf tool for benchmarking

More

For Moderators

Additional Notes