This library provides functionality for detecting a hotword in given audio file using MFCC features and Dynamic Time Warping (DTW) pattern matching algorithm.
This project is on pypi <https://pypi.python.org/pypi/hotword_detection/1.2>
_
To install from pypi::
pip install hotword_detection
From this repository::
git clone https://github.com/sakethgsharma/HotWordDetection.git
python setup.py install
Example scripts
For training a hotword, run::
python bin/trainHotword.py
For testing, run::
python bin/checkHotword.py
Supported features
MFCC vectors are used in this module since they are the most commonly extracted features used for speech recognition systems.
============= =========== Parameter Description ============= =========== alpha Parameter used in pre-emphasis filtering. Should be any value between 0 and 1. N Number of FFT points. fs Sampling frequency of stored audio file. frame_dur Duration of 1 speech frame. num_filters Number of filters used in the Mel filterbank. lower_freq Lower frequency bound used for constructing filterbank. upper_freq Upper frequency bound used for constructing filterbank. Should be less than fs/2. ============= ===========
Dynamic time warping (DTW) is an algorithm for measuring similarity between two temporal sequences which may vary in speed.
MFCC tutorial <http://www.practicalcryptography.com/miscellaneous/machine-learning/guide-mel-frequency-cepstral-coefficients-mfccs/>
_
DTW Wiki <http://en.wikipedia.org/wiki/Dynamic_time_warping>
_