albanie / collaborative-experts

Video embeddings for retrieval with natural language queries
https://www.robots.ox.ac.uk/~vgg/research/collaborative-experts/
Apache License 2.0
332 stars 55 forks source link

About raw text data #36

Closed ezeli closed 2 years ago

ezeli commented 2 years ago

Hi, thank you so much for such an excellent job! Can you provide the raw text data (including objects, actions, speech, OCR, etc.) extracted from the MSRVTT dataset. Because here seems to only contain feature data. Thanks again!

Tortoise17 commented 2 years ago

raw text is in pkl format available with each dataset. with MSRVTT also. See raw captions tile.

ezeli commented 2 years ago

raw-captions.pkl file contains only captions, no text data such as objects, actions, speech, OCR, etc.

ioanacroi commented 2 years ago

Hi,

We use only the feature data that is provided online. We do not use any raw text data for the used experts.

Cheers, Ioana

ezeli commented 2 years ago

OK, thanks!