DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.
Apache License 2.0
130
stars
14
forks
source link
fix: update README since we support 32k context length #12
Do as the title says.