whwu95 / Text4Vis

【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective
MIT License
202 stars 15 forks source link

Activitynet dataset #8

Closed wangyangchen closed 1 year ago

wangyangchen commented 1 year ago

Could you tell me how to extracts keyframes the Activitynet dataset and what are the rules?

whwu95 commented 1 year ago

Hi, Thank you for your interest in our work. To extract keyframes from the ActivityNet dataset, we directly use ffmpeg and extract frames at a frame rate of 1 frame per second (FPS).

wangyangchen commented 1 year ago

Thanks for your reply!