phonexiaresearch / VBx-training-recipe

Other
29 stars 11 forks source link

About the training data #4

Closed mycrazycracy closed 3 years ago

mycrazycracy commented 3 years ago

Hi,

Thanks for the great code. I have one question when I read through the code.

In train.sh, it claims that the total number of speakers in the training set is 8199. I'm not sure how this number comes. In run.sh, the training data contains voxceleb 1, voxceleb 2 dev and voxceleb-cn. VoxCeleb1 contains 1211/40 speakers in the dev/test set. VoxCeleb2 contains 5994/118 speakers in dev/test set. But what is voxceleb-cn? Is it cn-celeb which contains 800/200 speakers in the dev/test set?

Any advice would be appreciated! Thank you!

Jamiroquai88 commented 3 years ago

Hi,

Jamiroquai88 commented 3 years ago

@MichalKlco

mycrazycracy commented 3 years ago

Thank you for the useful information! We often call voxceleb-cn as "CN-Celeb" :-P