[Model Request] Qwen-VL - Githubissues

jhc13 / taggui

Tag manager and captioner for image datasets

GNU General Public License v3.0

613 stars 27 forks source link

[Model Request] Qwen-VL #65

Open Vigilence opened 5 months ago

Vigilence commented 5 months ago

I would love to see support for Qwen-VL added! The weights for this version are available, see below.

DarkAlchy commented 5 months ago

This would be a great addition.

jhc13 commented 5 months ago

Is there a specific reason you want this model added? It seems to offer no advantages over newer models like InternLM-XComposer2.

DarkAlchy commented 5 months ago

Qwen seems to be a bit better with realism images than Anime images as Cog is vs Intern with my usage of them.

Vigilence commented 5 months ago

I found qwen to be better at realistic images than the other models, but lacking in identifying certain art. This seems to be common for most models where they excel at certain images and fail at other types. No model at the moment excels at everything.

jhc13 commented 5 months ago

Did you try Qwen-VL and not Qwen-VL-Plus or Qwen-VL-Max? Qwen-VL is an old model and should be pretty weak.

Vigilence commented 5 months ago

It was qwen-Vl that I was testing. I spend a few hours with each model before I suggest it here. If it sucks then I don’t recommend it. If it’s promising for most images, some art styles, then I make a request.

jhc13 commented 5 months ago

I see. I will take a look at the model when I have time.

Vigilence commented 5 months ago

@jhc13 Please share a https://ko-fi.com/ link on the project page.

jhc13 commented 5 months ago

I have added a link.

Vigilence commented 5 months ago

Sent a gift, thank you!

jhc13 commented 5 months ago

Thank you!