Closed pseudotensor closed 4 months ago
https://huggingface.co/microsoft/Phi-3-vision-128k-instruct
No response
vllm is somewhat behind in vision support. idefics2 is supported by TGI and lllava next been out for months and not supported yet. There is a PR, is it close?
The vllm's multi-modality support is still under refactoring:
So we need waiting some necessary refactoring work (like ImageProcessor support) finished before we add new vision model.
🚀 The feature, motivation and pitch
https://huggingface.co/microsoft/Phi-3-vision-128k-instruct
Alternatives
No response
Additional context
vllm is somewhat behind in vision support. idefics2 is supported by TGI and lllava next been out for months and not supported yet. There is a PR, is it close?