cvondrick / soundnet

SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016
http://projects.csail.mit.edu/soundnet/
MIT License
462 stars 94 forks source link

Tips on crawling images from video #7

Closed keunwoochoi closed 7 years ago

keunwoochoi commented 7 years ago

Hi, I'd like to ask some tips that would be generally applicable in video/image stuff deep learning, I've been only working on music-related works. Some (if not all) might seem dumb ;)

Thanks!

cvondrick commented 7 years ago
  1. I would recommend JPEG.
  2. We extracted 1 frame per second.
  3. Video is huge, so I recommend designing efficient ways of storing them!