Closed exitudio closed 1 year ago
Hi~, Thanks for your interest in our work. First, using the full motion length requires additional strategies to train in parallel (e.g., padding), and it is time-consuming to pad all motions to the maximum motion length during training. Furthermore, CNNs can generalize well to different lengths, so 64 is enough to complete the reconstruction of motions.
Thank you for your insight information.
Thank you for your amazing work. I wonder why the first stage training (VAE) is using only 64 frames rather than the whole sequence.