ppwwyyxx / speaker-recognition

A Speaker Recognition System
Apache License 2.0
675 stars 275 forks source link

Only one speaker is predicted. #41

Open w4-hslee opened 7 years ago

w4-hslee commented 7 years ago

Hi, Yuxin Wu. Thank you for publishing a cool library.

Unfortunately, I faced a problem while testing to use it. The test environment is as follows. Four voices were added to the learning set, and each voice consists of two to four files. and Voices are registered through the command, "speaker-recognition.py -t enroll -i "./voice/*" -m model.out" enrollment was successfully completed.

Then, when prediction was performed with unused file from the test process, the following results were obtained.

./voice/test/hoseok/hoseok.wav -> seonyoung [failed] ./voice/test/christi/christi.wav -> seonyoung [failed] ./voice/test/seongjun/seongjun.wav -> seonyoung [failed] ./voice/test/seonyoung/seonyoung.wav -> seonyoung ./voice/test/ziye/ziye.wav -> seonyoung [failed]

The results of all test sets appear to be the same. One suspicious point is that seonyoung's learning data was the longest. Do you have any idea what is the problem?

richardm47 commented 7 years ago

@w4-hslee , How accurate are the results for you ? What is the length of samples used for training and length of sample for used for prediction ?