hiyouga / LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
https://arxiv.org/abs/2403.13372
Apache License 2.0
31.17k stars 3.84k forks source link

利用 vLLM 部署 OpenAI API #3827

Closed xiao-liya closed 4 months ago

xiao-liya commented 4 months ago

Reminder

Reproduction

运行参数命令如下: CUDA_VISIBLE_DEVICES=0 API_PORT=8000 python ./src/api_demo.py \ --model_name_or_path saves/Custom/lora/train_QWen_book_pt1_format_sft1/export \ --template qwen \ --finetuning_type lora

打开网页后显示如下:

postman { "detail": "Not Found" }

第一次进行接口调试,请问我应该修改哪些文件,感谢

Expected behavior

No response

System Info

No response

Others

No response

hiyouga commented 4 months ago

xxx:8000/docs

https://zhuanlan.zhihu.com/p/695287607