OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
https://internvl.readthedocs.io/en/latest/
MIT License
5.71k stars 445 forks source link

Zeroshot classification results for InternVL-G #30

Closed BIGBALLON closed 9 months ago

BIGBALLON commented 9 months ago

Hi, Are there any Zeroshot classification (ImageNet) evaluation results for InternVL-G?

czczup commented 9 months ago

Hello, I can list the results here if you want, but the classification performance of InternVL-G is not improved compared to InternVL-C. The main improvement of InternVL-G is retrieval capability.

BIGBALLON commented 9 months ago

Yes, I'm interested in how it performs on downstream Zeroshot classification tasks.

model name IN-1K IN-A IN-R IN-V2 IN-Sketch ObjectNet average
InternVL-C 83.2 83.8 95.5 77.3 73.9 80.6 0.8 82.4
InternVL-G - - - - - - - -
czczup commented 9 months ago

@BIGBALLON This is the result.

model name IN-1K IN-A IN-R IN-V2 IN-Sketch ObjectNet average
InternVL-C 83.2 83.8 95.5 77.3 73.9 80.6 0.8 82.4
InternVL-G 82.9 82.4 95.5 77.1 73.3 79.3 1.1 81.8
BIGBALLON commented 9 months ago

@czczup thanks !!!

BIGBALLON commented 7 months ago

I can list the results here if you want, but the classification performance of InternVL-G is not improved compared to InternVL-C. The main improvement of InternVL-G is retrieval capability.

Hi, @czczup, can you explain why InternVL-G improved retrieval capability?