mravanelli / SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.
MIT License
1.11k stars 260 forks source link

Performance on speaker identification is good ,but on speaker verification is poor #68

Open zrtlemontree opened 4 years ago

zrtlemontree commented 4 years ago

I'm using your model to train on TIMIT, my parameters are the same as yours ,such as lr、batchsize. I use the script named  ‘speaker_id.py ’ to train model, and ‘compute_d_vector.py’ to get embeddings of utterances. In my experiment, I use the cosine similarity to get score,and the performance on speaker identification is good (like your result opening in github ),but on speaker verification I cannot reach my expectations(so far my EER is 3%).So I want to know your EER on TIMIT(verification). Many thanks.

mravanelli commented 4 years ago

Sorry for my late reply. Which is the EER that you able to achieve?

zrtlemontree commented 4 years ago

Thanks for your reply. My EER is about 2.5%. I use the "speaker_id.py" script and AM-Softmax to train sincnet, use the "compute_d_vector.py" script to extract embedding, and then use the cosine similarity score to calculate the EER. I tried a variety of trials, such as a trial with a correct ratio of 9:1 , and 1:1, but the EER is about 2.5% or more. I can basically reproduce the results of your "speaker_id.py" script.

Thank you.

------------------ 原始邮件 ------------------ 发件人: "Mirco Ravanelli"<notifications@github.com>; 发送时间: 2019年11月16日(星期六) 晚上10:59 收件人: "mravanelli/SincNet"<SincNet@noreply.github.com>; 抄送: "张瑞腾"<570836446@qq.com>;"Author"<author@noreply.github.com>; 主题: Re: [mravanelli/SincNet] Performance on speaker identification is good ,but on speaker verification is poor (#68)

Sorry for my late reply. Which is the EER that you able to achive?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe.

tommyngx commented 3 years ago

@zrtlemontree Can you please share and help me with verification task? after get d-vector, i am confused to do that? please help! if possible, could you show the code you did for the verification task? thank you very much! my email is tommyfederation@gmail.com . Cheers

Shrijoy-Chowdhury commented 2 years ago

Hi I have trained the model using speaker-=_id.py script.Next I want to test my model on new dataset, can anyone help me out with it. I am not able to understand how to do it