issues
search
vectorch-ai
/
ScaleLLM
A high-performance inference system for large language models, designed for production environments.
https://docs.vectorch.com/
Apache License 2.0
316
stars
23
forks
source link
fix: decode ending tokens one by one to handle unfinished tokens
#229
Closed
guocuimi
closed
4 weeks ago