mira-space / MiraData

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
https://mira-space.github.io/
GNU General Public License v3.0
349 stars 9 forks source link

Original Scenario #3

Closed zansun-nusnaz closed 4 months ago

zansun-nusnaz commented 4 months ago

As far as I can tell there is no metadata for the "scenario" of each clip/video. it would be really nice to have that information available. Could you add it to the metadata or provide another table to match video_id with scenario?

zansun-nusnaz commented 4 months ago

Nevermind, I just noticed this:

To download the videos and split the videos into clips, you can use the following scripts:

python download_data.py --meta_csv miradata_v0.csv --video_start_id 0 --video_end_id 10631 --raw_video_save_dir miradata/raw_video --clip_video_save_dir miradata/clip_video

where the --video_start_id and --video_end_id indicates the start and end values of the download_idx in meta file's index for downloading. The gaming scenario is ranging from 0 to 7416 and the city/scenic exploration is ranging from 7417 to 10631.