add support for time-domain filter for pre-emphasis of high-frequencies for MFCCs

One essential difference between htk's variant of MFCC and other implementations is the preemphasis of high frequencies. This is done by means of a IIR filter (as far as I understand) as explained in chapter 5.4 of the htk book

I tried to use the IIR of essentia to reproduce the MFCCs:

        import essentia.standard as ess
        PREEMPH = 0.97: #  PREEMCOEF = 0.97 in htk
        preemph_filter = ess.IIR(numerator=[1-PREEMPH])

        # startFromZero = True, validFrameThresholdRatio = 1 : the way htk computes windows
        for frame in ess.FrameGenerator(audio, frameSize = frameSize, hopSize = hopSize , startFromZero = True, validFrameThresholdRatio = 1):
                frame_doubled_first = np.insert(frame,0,frame[0])  
                preemph_frame = preemph_filter(frame_doubled_first)
                frame = preemph_frame[1:]

But the resulting MFCCs are not the same with the ones from htk. One can use this repo that has audio examples and htk-extracted mfccs. Once this is solved, the code snippet should be added to the full example

MTG / essentia

add support for time-domain filter for pre-emphasis of high-frequencies for MFCCs #656