kkoutini / PaSST

Efficient Training of Audio Transformers with Patchout
Apache License 2.0
287 stars 48 forks source link

I have a problem. why convert wav to mp3? #29

Closed zdj97 closed 1 year ago

zdj97 commented 1 year ago

I have a problem. why convert .wav to .mp3 and 32k? And what would happen if converting to 16K and use .wav file?

zdj97 commented 1 year ago

Sor to ask this question. I am not sure if .wav file would cause problem to creat h5py.

kkoutini commented 1 year ago

Hi, I used mp3 to reduce the size of the dataset and overcome the slowdown from I/O. You can use wav as well, in that case, there is no need to decode the file here

Keep in mind if you change the samping rate, you need to adjust the STFT settings here to match.

zdj97 commented 1 year ago

Thanks! That is very useful for me.