Hello,Thanks for sharing the data. Could you please tell me the method used to generate the video captions within the WebVid dataset. Please provide some insights into whether the captions were:
Generated by a deep learning model, and if so, which model was used?
Manually written by human annotators, and if so, what guidelines were they provided with?
Scraped from the internet, and if so, what was the process for ensuring the relevance and quality of the captions?
Hello,Thanks for sharing the data. Could you please tell me the method used to generate the video captions within the WebVid dataset. Please provide some insights into whether the captions were: