SunzeY / AlphaCLIP

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
https://aleafy.github.io/alpha-clip
Apache License 2.0
703 stars 43 forks source link

Table 6: Performance of Alpha-CLIP in region level captioning #34

Open jetyingjia opened 8 months ago

jetyingjia commented 8 months ago

Great work! I am confused with Tab .6 result, the performance is Alpha-CLIP with LLaVA-1.5 or fine-tune this model with vicuna-7b on these datasets(RefCOCOg or VG)?

SunzeY commented 8 months ago

Hi, This have been discussed in #24.