sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.
Apache License 2.0
4.89k stars 335 forks source link

Do we have plan to support Siglip model for new llava-next-video architecture? #639

Closed LetheRiver0 closed 1 month ago

LetheRiver0 commented 1 month ago

Hi, I notice llava-next has published new version of llava-next-video model with llava-qwen and siglip vision tower, I wonder do have plan to support siglip in sglang? Thanks~

Ying1123 commented 1 month ago

cc @ZhangYuanhan-AI @Luodian

Luodian commented 1 month ago

Yes, we do have this implementation and will PR it later.👨‍💻

LetheRiver0 commented 1 month ago

That will be an amazing work! Looking forward to it~