DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.
Apache License 2.0
135
stars
15
forks
source link
使用千问2.0-7B加载千问2.5-3B模型提示"Only allowed now, your model Qwen2-7B" #42
OPENAI 报错
API Error: Status Code 400, {"object":"error","message":"Only allowed now, your model Qwen2-7B","code":40301}
docker部署。
启动日志: