What's the data source of one-category multi-object QA pairs?

jefferyZhan / Griffon

Official repo of Griffon series including v1(ECCV 2024), v2, and G

Apache License 2.0

112 stars 6 forks source link

Hi! Thanks for sharing your excellent work. I have a question on the one-category multi-object QA data used in the pre-training of griffon.

I could understand that one-category one-object and multi-category multi-object data can be converted from vanilla REC or detection data, but how are one-category multi-object QA pairs obtained? Are they only from flickr30k or converted from detection data by querying one single label? I'm also curious about the statistics of data of various forms in the dataset.

Looking forward to your reply and best wishes!

jefferyZhan / Griffon

What's the data source of one-category multi-object QA pairs? #1