issues
search
vectorch-ai
/
ScaleLLM
A high-performance inference system for large language models, designed for production environments.
https://docs.vectorch.com/
Apache License 2.0
316
stars
23
forks
source link
[wip] feat: added logprobs support for speculative decoding
#235
Closed
guocuimi
closed
3 weeks ago