xusenlinzy / api-for-open-llm

Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口
Apache License 2.0
2.36k stars 270 forks source link

change the parameter best_of of vllm chat_completion #289

Closed Tendo33 closed 5 months ago

Tendo33 commented 5 months ago

当使用 vllm 进行部署,并用 langchain 进行推理时。langchain的请求默认 best_of 为 1,如果请求中的 n > 1,vllm 会报以下错误。

1718332962630_309F19A2-D9D8-47c0-8EC4-67B53C842B84
xusenlinzy commented 5 months ago

已添加