Thanks for your great work. I am confused one thing in preporcessing stage. When we extract images, landmarks and audio features from a video, do we need to have the same number of these files because I got different numbers of file. For example, I got 2247 images and 2247 landmarks but audio features of 937 files only. Could someone please answer this issue?
Thanks for your great work. I am confused one thing in preporcessing stage. When we extract images, landmarks and audio features from a video, do we need to have the same number of these files because I got different numbers of file. For example, I got 2247 images and 2247 landmarks but audio features of 937 files only. Could someone please answer this issue?