jcsilva / docker-kaldi-gstreamer-server

Dockerfile for kaldi-gstreamer-server.
BSD 2-Clause "Simplified" License
287 stars 139 forks source link

Is there any solution for the low quality audio? #13

Closed wang850228803 closed 7 years ago

wang850228803 commented 7 years ago

Is there any solution for the low quality audio?

jcsilva commented 7 years ago

I didn't get what do you mean by low quality audio. Could you please be a bit more specific? And could you provide some examples?

wang850228803 commented 7 years ago

This is an example.

description09.aac.zip

jcsilva commented 7 years ago

Well,

you do have some problems with this sample.

1)It seems to be encoded/decoded by some low bitrate codec. This codec introduces some distortions and I think that if your acoustic model was not trained with some similar distortion you will have poor transcriptions.

2) Even though your data was sampled at 48000 Hz, your spectrogram shows a bandwith of just 6 ~7 kHz. I think it was done by your codec. The problem again is that if your acoustic model was not trained using the similar "low quality audio", you will have problems in your transcriptions