Janghyun1230 / Speaker_Verification

Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"
MIT License
354 stars 102 forks source link

the noise_filenum #1

Closed guozhonghao1994 closed 5 years ago

guozhonghao1994 commented 6 years ago

Hi! I have a question on your noise extraction part. In configuration.py you've marked the "noise_filenum" as "how many noise files will you use". In data_preprocess you seem to extract noises from each paired noise-clean data and save them as a *.npy file. In TD-SV train&test, however, you set the "noise_filenum" to default(16), which means that you add just #16 noise to any utterance. So what does "how many noise files will you use" mean? I think it's better to say "the # of noise file you will use". Is my understanding correct?

guozhonghao1994 commented 6 years ago

By the way, I can't see where your cossim() is used...do you have any idea about that?

Janghyun1230 commented 6 years ago

Sorry for late reply. For the first question you're right. Because I could not get public dataset for td-sv (several samples per person with same content) , so I tried to generate them by adding various noise. So "how many noise files will you use" means how many noise type will be used for such dataset.

Janghyun1230 commented 6 years ago

And at the first time, I used cossim() function... however, I think after modification it's not used anymore..