jcjohnson / densecap

Dense image captioning in Torch
MIT License
1.58k stars 430 forks source link

What is the ground truth when I use natural language queries to retrieve the source image? #68

Open helloworldwxr opened 7 years ago

helloworldwxr commented 7 years ago

In your paper, your dense captioning model can support image retrieval using natural language queries, and can localize these queries in retrieved images. What the ground truth when you calculate R@n?