Open chenqianfzh opened 3 weeks ago
Hi,I am pleased to see that vLLM has started to support Qlora. May I ask if the current version 0.5.0 only supports the llama-1 model or does it support the entire llama series of models?
such as this: https://huggingface.co/unsloth/mistral-7b-bnb-4bit/tree/main should be supported?
🚀 The feature, motivation and pitch
Support models like GptNeo
Alternatives
No response
Additional context
No response