Feature Request: Support for Qwen2-VL - Githubissues

ggerganov / llama.cpp

LLM inference in C/C++

MIT License

64.91k stars 9.31k forks source link

Feature Request: Support for Qwen2-VL #9246

Open isr431 opened 2 weeks ago

isr431 commented 2 weeks ago

Prerequisites

[X] I am running the latest code. Mention the version if possible as well.
[X] I carefully followed the README.md.
[X] I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
[X] I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

Qwen just released Qwen2-VL 2B & 7B under the Apache 2.0 License.

Motivation

SoTA understanding of images of various resolution & ratio: Qwen2-VL achieves state-of-the-art performance on visual understanding benchmarks, including MathVista, DocVQA, RealWorldQA, MTVQA, etc. Understanding videos of 20min+: Qwen2-VL can understand videos over 20 minutes for high-quality video-based question answering, dialog, content creation, etc.

Possible Implementation

No response

chigkim commented 2 weeks ago

+1 This would be another great addition!

crzroot commented 2 weeks ago

This model is awesome

suepradun commented 2 weeks ago

I am looking forward to it very much

xzlinux commented 2 weeks ago

+1 I am looking forward to it very much

yukiarimo commented 1 week ago

We can try Llamafing it

XDesktopSoft commented 1 week ago

+1

WildCatApp commented 1 week ago

+1

uestcbraid commented 1 week ago

+1

mrhalyang commented 1 week ago

+1

elyzionz commented 1 week ago

+1

2OsZI4ISYd commented 1 week ago

+1

Kimizhao commented 1 week ago

+1

enryteam commented 1 week ago

+1

yukiarimo commented 1 week ago

Any updates?

apipino commented 1 week ago

+1

Xhehab commented 1 week ago

+1

Seaman3body commented 1 week ago

+1

zenoverflow commented 1 week ago

+1

whoisltd commented 1 week ago

+1

eav-solution commented 6 days ago

+1

feynmanloo commented 5 days ago

I can not wait for it !!!

chigkim commented 5 days ago

Maybe people should also express interest and ask Qwen2-VL devs to implement. https://github.com/QwenLM/Qwen2-VL/issues/7

wmx-github commented 3 days ago

Expect to use llama.cpp end side inference

HimariO commented 3 days ago

Is anyone already working on this? If not, I would like to give it a try.

solangii commented 3 days ago

+1 is there any updates?

PredyDaddy commented 2 days ago

+1

shobhit9618 commented 2 days ago

+1

zhouxihong1 commented 1 day ago

+1