Closed ChunyiY closed 2 weeks ago
lmdeploy 服务端的日志是怎样的呢 ?
I have encountered similar problems, feeling that the health_check block is the default return 200, sometimes the request is too high service crash will not automatically restart
This issue is marked as stale because it has been marked as invalid or awaiting response for 7 days without any further response. It will be closed in 5 days if the stale label is not removed or if there is no further response.
This issue is closed because it has been stale for 5 days. Please open a new issue if you have similar issues or you have any new updates now.
Checklist
Describe the bug
I ran this code and did a load test based on my code, testing qwen2-72b. I am on a remote server with 4 A100 gpus to deploy this. I used locust as the platform to run my load test, I tried multiple user counts such as 100, 50 ,10, but they all hanged up after completing 900 requests.
After 900 requests, the model stopped processing new requests. 服务器在处理了900个请求之后挂住,不知道应该如何解决??
Reproduction
Environment
Error traceback
No response