Reproducing results with VB-DMD testset

sp-uhh / sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

MIT License

454 stars 69 forks source link

Hi,

Thank you very much for your work and also for the published resources.

As I don't have access to WSJ dataset, I'm trying to reproduce the paper's result on VB-DMD (taken from here: https://datashare.ed.ac.uk/handle/10283/2791).

I have a couple of questions:

Which checkpoint should be used for with enhancement.py for the VB-DMD noisy testset?
I've played a little with the code, and it seems that the .wav files from the VM-DMD noisy test are samples ad 48KHz, while the models were trained on 16KHz (https://github.com/sp-uhh/sgmse/issues/16#issuecomment-1333990502). Should the .wav files be downsampled to 16KHz before passing it to the model?
I also want to evaluate the dereverberation checkpoint. Can the preprocessing scripts be used also on the VB-DMD testset?

sp-uhh / sgmse