Open WhXmURandom opened 1 year ago
We do not do data augmentation nor any post-processing of the speaker embedding.
What specifically does "post-processing of the speaker embedding" refer to? Are you implying that you did not perform any score normalization or calibration when calculating the EER?
Are you implying that you did not perform any score normalization or calibration when calculating the EER?
yes, I did not do any normalization/calibration.
Thank you very much for your answer!
Hi,I have a question and I hope you can answer it.Generally speaking, the EER of ECAPA trained on Vox2 and tested on Vox1-o is around 1%. Why is your result 2.91%?