scaleapi / llm-engine

Scale LLM Engine public repository
https://llm-engine.scale.com
Apache License 2.0
779 stars 54 forks source link

Comparison benchmarks? #138

Open tripathiarpan20 opened 1 year ago

tripathiarpan20 commented 1 year ago

Hi, Thanks for open-sourcing the code.

I was wondering how it compares in terms of throughput with existing inference frameworks like https://github.com/huggingface/text-generation-inference and https://github.com/vllm-project/vllm , do we have any benchmarks?

rkaplan commented 1 year ago

Thanks for the request — we will be sure to add some benchmarks. cc @yixu34

Under the hood, the inference serving component is handled by HF Text Generation Inference, so the inference throughput should be similar or equivalent to that library.