Closed ccdv-ai closed 2 months ago
facing the same issue
The message about prompt logprobs not being supported is just a warning. The crash cause here is the KeyError:
File "/opt/anaconda/envs/transformers/lib/python3.9/site-packages/vllm/worker/model_runner.py", line 1233, in execute_model
model_executable = self.graph_runners[virtual_engine][
KeyError: 48
So I believe this issue is the same as https://github.com/vllm-project/vllm/issues/7907
should be solved by https://github.com/vllm-project/vllm/pull/7894
Your current environment
🐛 Describe the bug
Trying to use ngram speculative decoding (v0.5.1) but fails with:
WARNING 07-10 13:24:34 multi_step.py:57] Prompt logprob is not supported by multi step workers. (e.g., speculative decode uses multi step workers)
Full error: