Closed janjanusek closed 4 weeks ago
It's been a while since this frequency was requested đ
For others that may land here, I'm happy to add this feature! If somebody creates a pull request to add it, I will be quick to review it, merge it in, and publish a package so it's available on NuGet in good time.
However, I'm unlikely to take the time to research what the request means thoroughly enough to implement it myself.
ChatGPT prompt: I maintain a C# library called 'spectrogram' which generates frequency spectrogram images from signals (often audio, but not always). A user requested that I implement "mel-frequency cepstral coefficients with it's representation in bitmap", but I do not know what that means. Can you explain what this feature is and what it could be used for?
ChatGPT response:
Mel-frequency cepstral coefficients (MFCCs) are features used frequently in audio and speech processing. They capture characteristics of audio in a way that mimics how humans perceive sound. MFCCs are particularly useful for tasks like speech and music recognition, speaker identification, and even environmental sound classification.
Hereâs a breakdown:
Mel-Frequency Cepstrum: The âmelâ scale is based on human hearing, which perceives frequency logarithmically. MFCCs are calculated by taking a signal, transforming it into a spectrum, applying a mel scale filter bank, and then computing the logarithm of the energies in each band. Finally, a discrete cosine transform (DCT) of these log energies produces the MFCCs, which emphasize the more perceptually relevant parts of the audio signal.
MFCC Representation as Bitmap: Once MFCCs are calculated, theyâre often displayed as a âheatmapâ-style bitmap, where each row represents a different MFCC (or band), and the time progression is represented along the x-axis. This bitmap visually represents how the sound's spectral characteristics change over time, highlighting features that may be significant for audio analysis.
Use Cases: MFCCs are widely used in applications like:
To implement this feature in your 'spectrogram' library, youâd:
Hello, I've being playing with your library for a while and first of all I think it's great.
Is it possible for you to implement also mel-frequency cepstral coefficients with it's representation in bitmap?
Thanks