carolineec / EverybodyDanceNow

Motion Retargeting Video Subjects
Other
683 stars 138 forks source link

Issues in Dataset Statistics #28

Open hshreeshail opened 2 years ago

hshreeshail commented 2 years ago

I downloaded the dataset from here as mentioned on the project website. I faced some discrepancies in the numbers I read in the paper, and what I found in the dataset: 1] In Section 3: Dataset Collection of the appendix, the paper mentions that first 20% of the filmed footage is used for training and the last 80% for testing. However, when I calculated the number of frames of data (for the 5 subjects) in the downloaded datasets, I see the following numbers: Subject : train test ratio Subject 1: 11642 3598 3.23 Subject 2: 10623 2794 3.80 Subject 3: 9948 1848 5.38 Subject 4: 23410 4546 5.15 Subject 5: 25214 4998 5.04 Why the mismatch. Is it a typo in the paper?

2] In the same section, the paper mentions that for every subject, 120fps videos were shot, with the duration of each video being somewhere between 8 to 17 minutes. However, looking at the number of frames, the total runtime of all frames is only 13 min 41 seconds (assuming 120 fps), whereas it would be expected to be at least around 40+ minutes. Is only part of the filmed footage used in the final dataset?