Open giorgiolbt opened 3 years ago
No, that is not possible.
Use zero paddings. That's the common step in speech processing. Using keras, it only needs one line to make all utterances have the same row size.
Reference:
https://www.tensorflow.org/api_docs/python/tf/keras/preprocessing/sequence/pad_sequences
Hi,
I was wondering if it is possible to have a total number of windows that is fixed even when extracting LLD. At the moment, for each audio, I obtain a variable number of vectors of features that depends on the length of the audio since the window size is fixed. I would need to have for instance 200 rows for each audio independently from the audio's duration.
Thanks in advance! Giorgio