Diananini / LD-ConGR-CVPR2022

A Large RGB-D Video Dataset for Long-Distance Continuous Gesture Recognition
23 stars 2 forks source link

Pre-process videos to frames #5

Closed erwangccc closed 2 years ago

erwangccc commented 2 years ago

@Diananini Hi, sorry to bother you.

You said "All you need to do is extract frames of the with ffmpeg: ffmpeg -i videoPath -r 30 framesPath/%05d.png". And Maybe this command can process video one by one, it's so inconvenient if we process all videos. So could you give some suggestions about how to process videos more efficiently?

Diananini commented 2 years ago

Write a script to traverse all videos and extract frames for each video

erwangccc commented 2 years ago

OK, Thanks

erwangccc commented 2 years ago

Hi, i realize that videos size is about 42GB, But the frames from one video are about 2GB. It's almost 2T after processing all videos, is it normal? If it's normal, I think it's too huge for my device.

Diananini commented 2 years ago

PNG is a lossless compression format, thus requiring a lot of storage space. You can try other lossy compression formats, but this will decrease the accuracy of the model.

erwangccc commented 2 years ago

@Diananini Thanks for your response. I saved frames to 'jpg' data which is a compress format. All videos will generate 90GB data. I also think it will decrease the accuracy. Have you trian the model based on differet format data, such as 'jpg' format. How much will the accuray drop? Thanks

Diananini commented 2 years ago

We haven't trained the model on JPG images. We would appreciate it if you would like to report the experiment results for reference.

erwangccc commented 2 years ago

I'll share the results with you after the training. May i have you wechat please? The communication efficiency here is too low. My WeChat: wangshaochang. Thanks