Thanks for the great code. I have one question when I read through the code.
In train.sh, it claims that the total number of speakers in the training set is 8199. I'm not sure how this number comes. In run.sh, the training data contains voxceleb 1, voxceleb 2 dev and voxceleb-cn. VoxCeleb1 contains 1211/40 speakers in the dev/test set. VoxCeleb2 contains 5994/118 speakers in dev/test set. But what is voxceleb-cn? Is it cn-celeb which contains 800/200 speakers in the dev/test set?
Hi,
Thanks for the great code. I have one question when I read through the code.
In train.sh, it claims that the total number of speakers in the training set is 8199. I'm not sure how this number comes. In run.sh, the training data contains voxceleb 1, voxceleb 2 dev and voxceleb-cn. VoxCeleb1 contains 1211/40 speakers in the dev/test set. VoxCeleb2 contains 5994/118 speakers in dev/test set. But what is voxceleb-cn? Is it cn-celeb which contains 800/200 speakers in the dev/test set?
Any advice would be appreciated! Thank you!