Open alexhegit opened 5 months ago
Ollama implements an OpenAI compatible API so you might be able to benchmark against that using the OpenAI benchmark instructions from the llmperf README. For the OPENAI_API_KEY
you can pass in anything, but I think it's required.
Hello,
How to use llmperf to test ollama performance (TTFT, etc)?
Thanks, Alex