audeering / opensmile-python

Python package for openSMILE
https://audeering.github.io/opensmile-python/
Other
246 stars 33 forks source link

Extraction of fixed windows for LLD #27

Open giorgiolbt opened 3 years ago

giorgiolbt commented 3 years ago

Hi,

I was wondering if it is possible to have a total number of windows that is fixed even when extracting LLD. At the moment, for each audio, I obtain a variable number of vectors of features that depends on the length of the audio since the window size is fixed. I would need to have for instance 200 rows for each audio independently from the audio's duration.

Thanks in advance! Giorgio

frankenjoe commented 3 years ago

No, that is not possible.

bagustris commented 3 years ago

Use zero paddings. That's the common step in speech processing. Using keras, it only needs one line to make all utterances have the same row size.

Reference:
https://www.tensorflow.org/api_docs/python/tf/keras/preprocessing/sequence/pad_sequences