Open ErfolgreichCharismatisch opened 7 years ago
Well, in theory you'd go about removing them the same way as anything else: collect as large a sample as possible of these undesirable noises and train a KNN or SVM (or whatever) model to classify their occurrence. Then you can use the mtFileClassification function to find when they occur in your test data, and you could then crop them out.
Let's say, you say a lot of
um...
,er...
and dobreathing
andsmacking
/slurping
noises and say filling words likereally
. How would you go about removing them with pyAudioAnalysis in audio?