Open xubzhlin opened 5 months ago
I had the same problem...
Traceback (most recent call last): File "./swift/demo_server_vllm_xyf.py", line 106, in get_all_component_res async for request_output in results_generator: File "./vllm/vllm/engine/async_llm_engine.py",line 673,in generate async for output in self._process_request( File "./vllm/vllm/engine/async_llm_engine.py", line 780, in _process_request raise e File "./vllm/vllm/engine/asyncIlm_engine.py", line 776, in _process_request async for request output in stream: File "./vllm/vllm/engine/async_llm_engine.py", line 89, in _anext raise result File "./vllm/vllm/vllm/enggine/async_llm_engine.py", line 42, in _log_task_completiom return_value = task.result() File "./vllm/vllm/engine/async_limengine.py", line 532, in run_engine_loop has_requests_in_progress = await asyncio.wait_for( File "/opt/conda/envs/infer/lib/python3.10/asyncio/tasks.py", line 445in wait_for return fut.result() File "./vllm/vllm/vllm/engine/async_lngine.py", line 510, in engine_step self._request_tracker.process_request_output( File "./vllm/vllm/engine/async_llm_engine.py", line 130, in process_request_output self._request_streams[request_id].put(request_output) KeyError: 'cc2580f508eb473285a9e1bb47a6714f
Your current environment
🐛 Describe the bug
Run Stack
When I call multiple times
generator
in flask。This bug will appear。Later the error request_id is the same