BenjaminFiltjens / MS-GCN

Skeleton-based action segmentation with multi-stage spatial-temporal graph convolutional neural networks (MS-GCN)
32 stars 5 forks source link

About read_frame.py files #7

Closed Forgetmmmm closed 2 years ago

Forgetmmmm commented 2 years ago

Hello,I would like to ask if the input of read_frame.py file is the video, and the output is the joint data of the people in the video. What does the first dimension 6 of the downloaded feature file represent?

BenjaminFiltjens commented 2 years ago

Hi, As datasets typically come in different formats, we provide just a rough example of data preprocessing. The assumptions for the provided example are mentioned in the readme (i.e., input is a c3d format). We use c3d in our example because it is a common and standardized data format common in biomechanics, whereas for video there are already many useful repositories (e.g., you can get inspiration from mmskeleton (https://github.com/open-mmlab/mmskeleton)). Roughly, the preprocessing for video will be:

Now you also know what the first dimension is: we use 6 features per node for all datasets.