Closed Tongyuang closed 2 years ago
The audio features are consistant with those provided by the CMU team. As mentioned in their paper:
We use the COVAREP software to extract acoustic features including 12 Mel-frequency cepstral coefficients, pitch, voiced/unvoiced segmenting features, glottal source parameters, peak slope parameters and maxima dispersion quotients.
Thank you. It helps
Hi, I just want to know how the feature of audio from dataset MOSEI is calculated.
I loaded one of the datasets,
the result goes:
so it means each audio piece has a feature of shape(50,74), but how to calculate these features from raw audio files?(like .mp3 or .wav files?)