sstzal / DiffTalk

[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"
441 stars 41 forks source link

How did you split dataset for training and validation? #18

Open quqixun opened 1 year ago

quqixun commented 1 year ago

In data dir, there are data_test.txt for validation and data_train.txt for training. How did you split dataset? By portrait or by videos? By portrait means persons in training set are not repeated in validation set. By videos means randomly spit all videos into training and validation set.

In train_name.txt, there are 98 videos. However, 99 videos can be found in data_train.txt. What is the relationship between train_name.txt and data_train.txt?

Could you provide test_name.txt just like train_name.txt to indicate videos used in validation?