facebookresearch / denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Other
1.62k stars 299 forks source link

Pretrained model #134

Open caichuqiao opened 1 year ago

caichuqiao commented 1 year ago

Could you please provide the files of pretrained model, which can directly be used on the denoising? Thanks a lot!

adefossez commented 1 year ago

There is a colab in the readme on how to do this.

caichuqiao commented 1 year ago

There is a colab in the readme on how to do this.

Thanks for your answer! I tried to run the code on colab, but there's a RuntimeError, which says "Failed to load audio from ". Could you tell me how to figure it out?

Plutoisme commented 1 year ago

you can try ffmpeg.