Closed PeiMing1998 closed 11 months ago
Impressive work! But I am a confused why in Table2 llava and shikera are deemed as no zero-shot ability models? Thank you.
thanks, we consider them as non-zero-shot because they were trained with referring or grounding annotations, e.g., refcoco, flickr30, visual genome, etc.
Impressive work! But I am a confused why in Table2 llava and shikera are deemed as no zero-shot ability models? Thank you.