facebookresearch / denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Other
1.62k stars 299 forks source link

Enhancement Request: Browser plugin #136

Open peepo opened 1 year ago

peepo commented 1 year ago

would it be possible to develop this work as a plugin for Firefox/Chrome?

ie so the elderly might listen to streaming content with improving comprehension?

thanks!

Has this project been tested with those elderly people losing the ability to understand speech through hearing impairment?

adiyoss commented 1 year ago

Hi @peepo, Thanks for the suggestion! At the moment we are not planning to add web browser support (someone else from the community is more than welcome :)). Regarding testing our method on elderly people, we did not do any evaluation like that, we mainly evaluated our method on the standard academic benchmarks. However, this is a great suggestion!