关于跟视觉相关的shortcut统计相关问题 - Githubissues

PhoebusSi / VQA-VS

Code for our EMNLP-2022 paper: "Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA"

35 stars 2 forks source link

关于跟视觉相关的shortcut统计相关问题 #2

Open dggeed opened 9 months ago

dggeed commented 9 months ago

您好，感谢您的工作！我看您文章中附录B.3章节介绍关于如何计算KO/KOP/QT+KO-specific 相关的互信息那段描述中的提及了，KO: All objects (81 kinds of objects in total) appear in the images are provided in the VQA v2 annotation.，我想请教一下，您这里提及的VQAv2数据集的标注中一共有81种目标是怎么得到的呀？另外还想请教下，每条样本包含了<image, question, answer>，该样本中的image中包含了哪些object的相关信息也包含在VQAv2数据集的标注中吗？

PhoebusSi commented 9 months ago

这里81个目标在coco数据集中是给出来了的，可以找一下。是的，都在vqav2标注中～ ---- 回复的原邮件 @.>发送日期2023年10月25日 00:11 @.> @.***>主题[PhoebusSi/VQA-VS] 关于跟视觉相关的shortcut统计相关问题 (Issue #2) 您好，感谢您的工作！我看您文章中附录B.3章节介绍关于如何计算KO/KOP/QT+KO-specific 相关的互信息那段描述中的提及了，KO: All objects (81 kinds of objects in total) appear in the images are provided in the VQA v2 annotation.，我想请教一下，您这里提及的VQAv2数据集的标注中一共有81种目标是怎么得到的呀？另外还想请教下，每条样本包含了<image, question, answer>，该样本中的image中包含了哪些object的相关信息也包含在VQAv2数据集的标注中吗？

—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you are subscribed to this thread.Message ID: @.> [ { @.": "http://schema.org", @.": "EmailMessage", "potentialAction": { @.": "ViewAction", "target": "https://github.com/PhoebusSi/VQA-VS/issues/2", "url": "https://github.com/PhoebusSi/VQA-VS/issues/2", "name": "View Issue" }, "description": "View this Issue on GitHub", "publisher": { @.***": "Organization", "name": "GitHub", "url": "https://github.com" } } ]

dggeed commented 9 months ago

感谢您的回复，再麻烦请教下：每张image中包含了哪些object的相关信息包含在VQAv2数据集的标注中是哪个版本的数据集呢。我下载了之前VQAv2数据集和您这篇文章中的VQA-VS数据集，question标注信息中都只包含了{question_id, image_id, question}，而answer标注信息中也只包含了{question_type, multiple_choice_answer, image_id, answer_type, question_id}信息，没有看到哪里有说每张image中包含了哪些object呀

PhoebusSi commented 9 months ago

我记得vqav2的图片来自于coco数据集，可以去coco数据集的标注找下～ ---- 回复的原邮件 @.>发送日期2023年10月25日 10:31 @.> 抄送人Qingyi @.>@.>主题Re: [PhoebusSi/VQA-VS] 关于跟视觉相关的shortcut统计相关问题 (Issue #2) 感谢您的回复，再麻烦请教下：每张image中包含了哪些object的相关信息包含在VQAv2数据集的标注中是哪个版本的数据集呢。我下载了之前VQAv2数据集和您这篇文章中的VQA-VS数据集，question标注信息中都只包含了{question_id, image_id, question}，而answer标注信息中也只包含了{question_type, multiple_choice_answer, image_id, answer_type, question_id}信息，没有看到哪里有说每张image中包含了哪些object呀

—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you commented.Message ID: @.> [ { @.": "http://schema.org", @.": "EmailMessage", "potentialAction": { @.": "ViewAction", "target": "https://github.com/PhoebusSi/VQA-VS/issues/2#issuecomment-1778402789", "url": "https://github.com/PhoebusSi/VQA-VS/issues/2#issuecomment-1778402789", "name": "View Issue" }, "description": "View this Issue on GitHub", "publisher": { @.***": "Organization", "name": "GitHub", "url": "https://github.com" } } ]

PhoebusSi commented 9 months ago

coco2014 ---- 回复的原邮件 @.>发送日期2023年10月25日 10:31 @.> 抄送人Qingyi @.>@.>主题Re: [PhoebusSi/VQA-VS] 关于跟视觉相关的shortcut统计相关问题 (Issue #2) 感谢您的回复，再麻烦请教下：每张image中包含了哪些object的相关信息包含在VQAv2数据集的标注中是哪个版本的数据集呢。我下载了之前VQAv2数据集和您这篇文章中的VQA-VS数据集，question标注信息中都只包含了{question_id, image_id, question}，而answer标注信息中也只包含了{question_type, multiple_choice_answer, image_id, answer_type, question_id}信息，没有看到哪里有说每张image中包含了哪些object呀

—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you commented.Message ID: @.> [ { @.": "http://schema.org", @.": "EmailMessage", "potentialAction": { @.": "ViewAction", "target": "https://github.com/PhoebusSi/VQA-VS/issues/2#issuecomment-1778402789", "url": "https://github.com/PhoebusSi/VQA-VS/issues/2#issuecomment-1778402789", "name": "View Issue" }, "description": "View this Issue on GitHub", "publisher": { @.***": "Organization", "name": "GitHub", "url": "https://github.com" } } ]

dggeed commented 9 months ago

COCO这篇文章我看了下的，文中说的是包含91种类型的object，然后我看您文章中写的“81种object来自VQAv2数据集提供的标注”，所以，我感觉VQAv2数据集是不是并没有用到COCO数据集中的image全集，但是又不是很清楚您写的由VQAv2中包含的81种object标注信息是哪个版本的VQAv2

dggeed commented 9 months ago

还有就是我看您文章中利用公式9去计算object与answer之间的互信息的话，那您用的VQAv2数据版本中的数据标注中应该是包含了image中object的标注信息的。所以主要是想请教下，您用的哪个版本的VQAv2数据集呀，因为我目前的工作中也想用到image中包含的object信息，谢谢啦~

PhoebusSi commented 9 months ago

object信息来自于coco2014的标注我记得vqav2只有一个版本吧可以看下他的论文里的数据下载链接然后每个图片的image id在coco2014的annotationa中有对应的object标注～ ---- 回复的原邮件 @.>发送日期2023年10月25日 11:39 @.> 抄送人Qingyi @.>@.>主题Re: [PhoebusSi/VQA-VS] 关于跟视觉相关的shortcut统计相关问题 (Issue #2)

还有就是我看您文章中利用公式9去计算object与answer之间的互信息的话，那您用的VQAv2数据版本中的数据标注中应该是包含了image中object的标注信息的。所以主要是想请教下，您用的哪个版本的VQAv2数据集呀，因为我目前的工作中也想用到image中包含的object信息，谢谢啦~

—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you commented.Message ID: @.> [ { @.": "http://schema.org", @.": "EmailMessage", "potentialAction": { @.": "ViewAction", "target": "https://github.com/PhoebusSi/VQA-VS/issues/2#issuecomment-1778455264", "url": "https://github.com/PhoebusSi/VQA-VS/issues/2#issuecomment-1778455264", "name": "View Issue" }, "description": "View this Issue on GitHub", "publisher": { @.***": "Organization", "name": "GitHub", "url": "https://github.com" } } ]

dggeed commented 9 months ago

好的，非常感谢您的回复，我去看看~