Convolve vs. BlockConvolve for RIR Augmentation

jzlianglu / pykaldi2

Yet another speech toolkit based on Kaldi and PyTorch

MIT License

173 stars 33 forks source link

Hi, have been using your Simulator functionality and found it quite useful. However, the augmented data I'm obtaining from it has a ton of reverb (more than I'm expecting). Still diagnosing the problem, but is there any reason why this repo is using the equivalent of

FFTbasedConvolveSignals https://github.com/kaldi-asr/kaldi/blob/master/src/feat/signal.cc#L50

as opposed to

FFTbasedBlockConvolveSignals https://github.com/kaldi-asr/kaldi/blob/master/src/feat/signal.cc#L77

Kaldi does reverb by using the second https://github.com/kaldi-asr/kaldi/blob/master/src/featbin/wav-reverberate.cc#L96. Thanks!

jzlianglu / pykaldi2

Convolve vs. BlockConvolve for RIR Augmentation #9