jina-ai / rungpt

An open-source cloud-native of large multi-modal models (LMMs) serving framework.
Apache License 2.0
153 stars 21 forks source link

support instructBLIP model #20

Open numb3r3 opened 1 year ago

numb3r3 commented 1 year ago

As title, lavis just released a new vision-language instruction-tuning framework using BLIP-2 models, achieving state-of-the-art zero-shot generalization performance on a wide range of vision-language tasks. https://github.com/salesforce/LAVIS/tree/main/projects/instructblip

nomagick commented 1 year ago

Replicate now has a runnable version: https://replicate.com/joehoover/instructblip-vicuna13b