cvdfoundation / kinetics-dataset

758 stars 96 forks source link

Videos inside train and test folder may have different names. (K400) #39

Open hamzakarim94 opened 2 years ago

hamzakarim94 commented 2 years ago

The following is the annotations file for training: The following is the list of videos present inside the train folder: origtrain.txt

Many videos that are present inside the annotations file are either missing or have a different name: Some examples are: absent: ['abseiling' 'lqciwm6gDrk' 659 669 'train' 0] absent: ['abseiling' 'Lwti_IVm-Bc' 39 49 'train' 0] absent: ['abseiling' 'LwyKxe85UWI' 88 98 'train' 0] absent: ['abseiling' 'lXnebafO2cI' 2145 2155 'train' 0] absent: ['abseiling' 'LY02AE6XK5I' 381 391 'train' 0] absent: ['abseiling' 'M-hBdj62g9Y' 48 58 'train' 0] absent: ['abseiling' 'm-iKFbNcLYM' 30 40 'train' 0] absent: ['abseiling' 'M1QFHoC4o3A' 78 88 'train' 0] absent: ['abseiling' 'm25BcZ3B0Hs' 219 229 'train' 0] absent: ['abseiling' 'M6yv0dy8lYE' 297 307 'train' 0] absent: ['abseiling' 'm8Pm5kmCuqI' 64 74 'train' 0] absent: ['abseiling' 'MIIbU2xZcUY' 32 42 'train' 0] absent: ['abseiling' 'mjsrWa2olhk' 35 45 'train' 0] absent: ['abseiling' 'MP-Op52e84g' 176 186 'train' 0] absent: ['abseiling' 'MqBaIW3qmuM' 98 108 'train' 0] absent: ['abseiling' 'mRdyYMPlJ_8' 73 83 'train' 0]

Can someone please confirm if they have the same issue or am I missing something? Thank you

zanedurante commented 2 years ago

Were you able to resolve this? I am not sure exactly how the IDs in the annotation csv file relate to the downloaded videos. I found a problem when going the other way that certain downloaded videos did not have corresponding entries in the annotation file. For example, I couldn't find a label for these videos:

train/tYGO9QCQfMU_000015_000025.mp4
train/mEIeT8TcwVU_000000_000010.mp4
train/GPNkUmpic9c_000024_000034.mp4
train/qF3VV-vscPA_000015_000025.mp4
train/xlQ1FqRZw18_000084_000094.mp4
train/I2qDBDC4qi0_000003_000013.mp4
train/e4JHEHu5BKo_000260_000270.mp4
train/Q1DVNWCswGk_000166_000176.mp4
...

To be concrete, there is no entry in the provided train.csv for tYGO9QCQfMU or any of the other files I listed.

shngt commented 1 year ago

Following up since I'm having the same issue - were either of you able to resolve this? It seems impossible to figure out what the class of each video is.