This pull request introduces the GuideLLM CLI guide, README enhancements, image uploads, and the supported backends documentation to highlight all the backends that can be used with GuideLLM.
Test Cases:
The GuideLLM CLI has been tested with various LLM models and backends.
Unit tests ensure core functionalities work as expected.
Documentation:
Created documentation detailing the GuideLLM CLI usage and output metrics.
Created documentation detailing the openai-compatible API/HTTP pathway for TGI, llama.cpp, and DeepSparse in supported_backends.md
Additional Information:
The pull request includes changes to the docs/guides directory for the CLI documentation.
Binary files containing performance summary visualizations are added to the docs/assets directory.
Summary:
This pull request introduces the GuideLLM CLI guide, README enhancements, image uploads, and the supported backends documentation to highlight all the backends that can be used with GuideLLM.
Test Cases: The GuideLLM CLI has been tested with various LLM models and backends. Unit tests ensure core functionalities work as expected.
Documentation: Created documentation detailing the GuideLLM CLI usage and output metrics. Created documentation detailing the openai-compatible API/HTTP pathway for TGI, llama.cpp, and DeepSparse in supported_backends.md
Additional Information: The pull request includes changes to the docs/guides directory for the CLI documentation. Binary files containing performance summary visualizations are added to the docs/assets directory.
Please review and provide feedback.