rank0: Traceback (most recent call last):
rank0: File "/home/arsal/anaconda3/envs/vllm/lib/python3.10/runpy.py", line 196, in _run_module_as_main
rank0: return _run_code(code, main_globals, None,
rank0: File "/home/arsal/anaconda3/envs/vllm/lib/python3.10/runpy.py", line 86, in _run_code
rank0: exec(code, run_globals)
rank0: File "/home/arsal/anaconda3/envs/vllm/lib/python3.10/site-packages/vllm/entrypoints/openai/run_batch.py", line 146, in
rank0: File "/home/arsal/anaconda3/envs/vllm/lib/python3.10/asyncio/runners.py", line 44, in run
rank0: return loop.run_until_complete(main)
rank0: File "/home/arsal/anaconda3/envs/vllm/lib/python3.10/asyncio/base_events.py", line 649, in run_until_complete
rank0: return future.result()
rank0: File "/home/arsal/anaconda3/envs/vllm/lib/python3.10/site-packages/vllm/entrypoints/openai/run_batch.py", line 130, in main
rank0: responses = await asyncio.gather(*response_futures)
rank0: File "/home/arsal/anaconda3/envs/vllm/lib/python3.10/site-packages/vllm/entrypoints/openai/run_batch.py", line 93, in run_request
rank0: File "/home/arsal/anaconda3/envs/vllm/lib/python3.10/site-packages/pydantic/main.py", line 193, in initrank0: self.__pydantic_validator__.validate_python(data, self_instance=self)
rank0: pydantic_core._pydantic_core.ValidationError: 1 validation error for BatchResponseData
Your current environment
🐛 Describe the bug
Running this command to do batch inference through API, returns the following error. The input.jsonl is as per required format.
python -m vllm.entrypoints.openai.run_batch -i input.jsonl -o results.jsonl --model Granther/Gemma-2-9B-Instruct-4Bit-GPTQ --max_model_len 3000
Error Traceback: