利用 vLLM 部署 OpenAI API

Reminder

[X] I have read the README and searched the existing issues.

Reproduction

运行参数命令如下： CUDA_VISIBLE_DEVICES=0 API_PORT=8000 python ./src/api_demo.py \ --model_name_or_path saves/Custom/lora/train_QWen_book_pt1_format_sft1/export \ --template qwen \ --finetuning_type lora

打开网页后显示如下：

postman { "detail": "Not Found" }

第一次进行接口调试，请问我应该修改哪些文件，感谢

Expected behavior

No response

System Info

No response

Others

No response

hiyouga / LLaMA-Factory

利用 vLLM 部署 OpenAI API #3827

Reminder

Reproduction

Expected behavior

System Info

Others