jefferyZhan / Griffon

Official repo of Griffon series including v1(ECCV 2024), v2, and G
Apache License 2.0
112 stars 6 forks source link

What's the data source of one-category multi-object QA pairs? #1

Closed likanchuan closed 11 months ago

likanchuan commented 12 months ago

Hi! Thanks for sharing your excellent work. I have a question on the one-category multi-object QA data used in the pre-training of griffon.

I could understand that one-category one-object and multi-category multi-object data can be converted from vanilla REC or detection data, but how are one-category multi-object QA pairs obtained? Are they only from flickr30k or converted from detection data by querying one single label? I'm also curious about the statistics of data of various forms in the dataset.

Looking forward to your reply and best wishes!

jefferyZhan commented 12 months ago

Hi Kanchuan,

Thanks for your attention to our work. In this stage, the One Category with Multi Objects pairs are generated from phase grounding datasets including Flickr30K and GRIT, which we'll release later. We do not converted the detection data by querying one single label, as we already include the whole detection data as the Multi Categories with Multi Objects type. As for the statistic of different scenario data, we may illustrate it in the dataset page when making the dataset public in schedule.