I'm not working on this project anymore. I advise everyone curious about voice detection to have a look at some more modern approaches using deep learning, like:
Python code to apply voice activity detector to wave file. Voice activity detector based on ration between energy in speech band and total energy.
Input audio data treated as following:
Create object:
Example python script which saves speech intervals in json file:
./detectVoiceInWave.py ./wav-sample.wav ./results.json
Example python code to plot detected speech regions:
from vad import VoiceActivityDetector
filename = '/Users/user/wav-sample.wav'
v = VoiceActivityDetector(filename)
v.plot_detected_speech_regions()
Alexander USOLTSEV 2015 (c) MIT License