Open JoaoSartoreto opened 1 month ago
Hey João, thanks for the issue submission! Your plan sounds good. I haven't really seen pre-emphasis filters being used for music applications - do you have any examples from papers? Another thing to consider is delta features (see librosa.feature.delta). 1st and 2nd order deltas of MFCCs are often used in MIR research (you'd simply concatenate the array with the original MFCC with that of the 1st and 2nd order ones).
Keep in mind that the package is currently restricted to simply using a feature name, without the option to configure different options for a feature. It would be nice in the future to be able to call mfcc(pre-emphasis=True, deltas=True)
, but for now if you want to implement a couple different options for MFCC you'd have to use different feature names (e.g. mfcc
, mfcc-deltas
).
Hey Christos,
Thanks for the feedback on the issue submission! The idea of using pre-emphasis filters for music applications was suggested by a friend from college. However, I didn’t have a strong academic background on the topic and was trying to find some papers to support the idea. So far, I haven’t found any relevant examples in the literature.
Given this, I plan to follow the standard implementation approach. I’ll focus on using delta features (librosa.feature.delta) and concatenating the original MFCCs with the 1st and 2nd order ones, as you mentioned.
If you have any additional tips or suggestions, I’d love to hear them!
Thanks again!
Description: I intend to work on implementing the MFCC for feature extraction from audio signals. The MFCC is a popular technique for extracting features from audio signals such as voice or music, and is widely used in applications like voice recognition and music classification.
Implementation Details:
Libraries:
Functions:
Estimated Timeline: