SunzeY / AlphaCLIP

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
https://aleafy.github.io/alpha-clip
Apache License 2.0
601 stars 33 forks source link

Captions in GRIT #41

Open jiaosiyu1999 opened 4 months ago

jiaosiyu1999 commented 4 months ago

Thank you for your work.

Considering the captions in the GRIT dataset consist solely of noun words like berries, person ... Did you use Templates to expand the captions, such as "a photo of a xxx"?

SunzeY commented 4 months ago

I believe GRIT have referring expression in the context instead of solely noun of words like this example below. We use their original referring expression without any template. 000014732_exp