Closed jeremyfowers closed 2 months ago
This update adds the turnkey-llm CLI and turnkeyml.llm APIs, which provide LLM-specific tools for prompting, serving, and accuracy tests.
turnkey-llm
turnkeyml.llm
This initial release has device/runtime support for:
The tools provided are:
huggingface-load
oga-load
ryzenai-npu-load
llm-prompt
serve
accuracy-mmlu
accuracy-perplexity
Learn more about the features and usage in the README.md at src/turnkeyml/llm/README.md.
This update adds the
turnkey-llm
CLI andturnkeyml.llm
APIs, which provide LLM-specific tools for prompting, serving, and accuracy tests.This initial release has device/runtime support for:
The tools provided are:
huggingface-load
: load LLM from huggingface checkpointoga-load
: load LLM using OnnxRuntime-GenAI onto DirectML deviceryzenai-npu-load
: load LLM onto RyzenAI NPU devicellm-prompt
: chat with your LLMserve
: launch a websocket server for your LLMaccuracy-mmlu
andaccuracy-perplexity
: measure the performance of your LLM on the target backendLearn more about the features and usage in the README.md at src/turnkeyml/llm/README.md.