Open sky-2002 opened 2 weeks ago
LoRA adapters are not supported yet for multi-modal models. See #7199.
hey @DarkLight1337 , if not the whole model, only this part and inference part cane be done with vLLM right?
Currently yes, you are welcome to extend vLLM with new embedding models though!
Your current environment
How would you like to use vllm
I want to run inference of ColPali. I don't know how to integrate it with vllm. It used
PaliGemma
which is there invLLM
, but it also loads some adapters. Please let me know if it can be used right-away, or if any changes need to be made, let me know, I am happy to contribute.Before submitting a new issue...