How to use madmom to detect onset of human speech in any audio file?

CPJKU / madmom

Python audio and music signal processing library

Other

1.35k stars 206 forks source link

Not out of the box. There are several different approaches on how to detect/separate speech from non-speech, but none of them are integrated in madmom (yet). You could have a look at the works of my (former) colleagues Jan Schlüter and Reinhard Sonnleitner presented at DAFx 2012. There are plenty of others of course, but these are the first that come to my mind. The latter feature should be easy to implement, an (inefficient) implementation of the correlation part is already in features.onsets.

HTH

CPJKU / madmom

How to use madmom to detect onset of human speech in any audio file? #347