neuralmagic / guidellm

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
Apache License 2.0
128 stars 7 forks source link

Docs Updates #61

Open rgreenberg1 opened 6 days ago

rgreenberg1 commented 6 days ago

Summary:

This pull request introduces the GuideLLM CLI guide, README enhancements, image uploads, and the supported backends documentation to highlight all the backends that can be used with GuideLLM.

Test Cases: The GuideLLM CLI has been tested with various LLM models and backends. Unit tests ensure core functionalities work as expected.

Documentation: Created documentation detailing the GuideLLM CLI usage and output metrics. Created documentation detailing the openai-compatible API/HTTP pathway for TGI, llama.cpp, and DeepSparse in supported_backends.md

Additional Information: The pull request includes changes to the docs/guides directory for the CLI documentation. Binary files containing performance summary visualizations are added to the docs/assets directory.

Please review and provide feedback.