Open santurini opened 4 months ago
My first time ever using vLLM and it happens that I wanted to use it with phi-3, sure enough, same issue. Here is my repro too:
Python: 3.10 CUDA: 12.2
from vllm import SamplingParams, LLM
prompts = [
"Hello, my name is",
"The president of the United States is",
"The capital of France is",
"The future of AI is",
]
sampling_params = SamplingParams(temperature=0.8, top_p=0.95)
llm = LLM(model="microsoft/Phi-3-mini-128k-instruct")
outputs = llm.generate(prompts, sampling_params)
# Print the outputs.
for output in outputs:
prompt = output.prompt
generated_text = output.outputs[0].text
print(f"Prompt: {prompt!r}, Generated text: {generated_text!r}")
Traceback (most recent call last):
File "/home/nevercast/vllm/vllm_run.py", line 11, in <module>
llm = LLM(model="microsoft/Phi-3-mini-128k-instruct")
File "/home/nevercast/vllm/.venv/lib/python3.10/site-packages/vllm/entrypoints/llm.py", line 144, in __init__
self.llm_engine = LLMEngine.from_engine_args(
File "/home/nevercast/vllm/.venv/lib/python3.10/site-packages/vllm/engine/llm_engine.py", line 336, in from_engine_args
engine_config = engine_args.create_engine_config()
File "/home/nevercast/vllm/.venv/lib/python3.10/site-packages/vllm/engine/arg_utils.py", line 630, in create_engine_config
model_config = ModelConfig(
File "/home/nevercast/vllm/.venv/lib/python3.10/site-packages/vllm/config.py", line 137, in __init__
self.hf_config = get_config(self.model, trust_remote_code, revision,
File "/home/nevercast/vllm/.venv/lib/python3.10/site-packages/vllm/transformers_utils/config.py", line 48, in get_config
raise e
File "/home/nevercast/vllm/.venv/lib/python3.10/site-packages/vllm/transformers_utils/config.py", line 33, in get_config
config = AutoConfig.from_pretrained(
File "/home/nevercast/vllm/.venv/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py", line 989, in from_pretrained
return config_class.from_dict(config_dict, **unused_kwargs)
File "/home/nevercast/vllm/.venv/lib/python3.10/site-packages/transformers/configuration_utils.py", line 772, in from_dict
config = cls(**config_dict)
File "/home/nevercast/vllm/.venv/lib/python3.10/site-packages/transformers/models/phi3/configuration_phi3.py", line 158, in __init__
self._rope_scaling_validation()
File "/home/nevercast/vllm/.venv/lib/python3.10/site-packages/transformers/models/phi3/configuration_phi3.py", line 185, in _rope_scaling_validation
raise ValueError(f"`rope_scaling`'s type field must be one of ['su', 'yarn'], got {rope_scaling_type}")
ValueError: `rope_scaling`'s type field must be one of ['su', 'yarn'], got longrope
This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!
Your current environment
🐛 Describe the bug
When trying to initialize an LLM instance of vllm I keep getting the same error no matter what I do. I found out some recent changes specific to the Phi3 model in this commit #5595 but yesterday it was working fin. Below the code and error: