Open kdu4108 opened 4 months ago
Finished by https://github.com/swiss-ai/ml-4m/pull/17
One thing we overlooked is we actually want to have a directory of the format
root/video_rgb/train/*.tar
root/video_rgb/val/*.tar
root/video_rgb/test/*.tar
So we should modify the script that goes from raw to video_rgb to do this train/val/test split as well.
Goal: given v2d format of
produce a
video_rgb/
modality data folder of the following format:Option 1: This should mostly just involve extracting the mp4/video files from the video2dataset format and moving it into the right directory paths.
Option 2: We can use v2d now to normalize the videos by making them same number of frames, etc.
We choose option #2 because by the time we get something in a modality folder, it should already be the last preprocessing step before pseudolabeling for aligned data.
Child issue of #3.