neuralmagic / guidellm

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
Apache License 2.0
159 stars 11 forks source link

✨ vLLM Backend integration #42

Open parfeniukink opened 2 months ago

parfeniukink commented 2 months ago

Summary

This PR extends the PR: Deepsparse Backend implementation. The base branch is parfeniukink/features/deepsparse-backend.

Usage

This is an example of a command you can use in your terminal: