astorfi / speechpy

:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
Apache License 2.0
880 stars 104 forks source link

Extracting log mel filterbank features #32

Closed bwang482 closed 5 years ago

bwang482 commented 5 years ago

Thanks very much for the great library! It's my default library for speech processing now.

Just want to double check on the following, I want to extract 40-dimensional log mel filterbank feautres from sliding a Hamming window of width 25ms with an overlap of 10ms. Does the code below extract the right features? I am a bit uncertain whether frame_stride=0.01 creates overlap of 10ms..

fs, signal = wav.read(file_path)
lmfe = speechpy.feature.lmfe(signal, sampling_frequency=fs, frame_length=0.025, frame_stride=0.01, num_filters=40, fft_length=512, low_frequency=0, high_frequency=None)

Thanks!

MichelleYang2017 commented 5 years ago

I suggest that you should read some source code:

frame_stride (float): the step between successive frames in seconds.

So,frame_stride means that no overlap