Open predestination opened 5 years ago
Hi, if you expect most of your input is noisy or is unique in certain ways (like speaker accent, background noise), then it's better to train the custom acoustic model with this type of audio. The IBM Debater uses this approach and was able to reduce the error rate to ~5%. If you have a few audio clips and want to do noise reduction, I did a quick search and saw a few options:
Thank you for the reply, I tried noisereduce and logmmse earllier but it didn't improve the transcript quality. Will check the scipy signal.
Hey, what are the possible Audio Pre-processing steps that can be used to improve transcript quality? Is there any library in python for denoising or audio enhancement without using deep learning ( as it is taking lot of time for a small audio clip). ?