Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Hi @peepo,
Thanks for the suggestion! At the moment we are not planning to add web browser support (someone else from the community is more than welcome :)).
Regarding testing our method on elderly people, we did not do any evaluation like that, we mainly evaluated our method on the standard academic benchmarks. However, this is a great suggestion!
would it be possible to develop this work as a plugin for Firefox/Chrome?
ie so the elderly might listen to streaming content with improving comprehension?
thanks!
Has this project been tested with those elderly people losing the ability to understand speech through hearing impairment?