Improve voice activity detection

encs-humanoid / speech-and-hearing

speech and hearing systems for the IEEE ENCS Humanoid Robot Project

1 stars 1 forks source link

Integrate an improved algorithm for voice activity detection into listen_node.py.

The current voice activity detection algorithm is based only on the sound intensity measured during an audio capture window. This may be made more robust by using a more sophisticated algorithm.

This issue proposes to introduce the Moattar and Homayounpour algorithm [1], which is described as a simple but efficient real-time voice activity detection algorithm. Source code for implementing the algorithm in python is available in GitHub [2], but it operates on a file rather than an audio stream. The task is to modify the referenced implementation to work within the audio processing model of listen_node.py.

[1] http://www.eurasip.org/Proceedings/Eusipco/Eusipco2009/contents/papers/1569192958.pdf [2] https://github.com/shriphani/Listener

encs-humanoid / speech-and-hearing

Improve voice activity detection #2