Performance on short speech

HPI-DeepLearning / crnn-lid

Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks

GNU General Public License v3.0

104 stars 48 forks source link

Hi there, first, thanks for the toolkit.

I am interested in applying this on short audios. I did a simple test by chopping the web-server/audio/samples audios into 10 seconds segments and ran predict.py separately on these segments with the existing model from web-server folder (assuming this model would be the best;)). When predicting them separately, the accuracy seemed quite low, about 60%. More similar tests with our own dataset received worse results... I understand short audio would be much tougher, but I still wonder if you'd have any insights if we can improve this. Thanks in advance.

Ben

HPI-DeepLearning / crnn-lid

Performance on short speech #12