Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Hello! I saw that your code and models are released under CC-BY-NC 4.0 license. And as I understand it, commercial use is prohibited. Could you tell us why you decided to use it?
Also I wanted to ask if there are any updates on this? Have you considered switching to an MIT license?
Especially in the context of the fact that facebook has released llama 2 with the possibility of commercial use
Hello! I saw that your code and models are released under CC-BY-NC 4.0 license. And as I understand it, commercial use is prohibited. Could you tell us why you decided to use it? Also I wanted to ask if there are any updates on this? Have you considered switching to an MIT license?
Especially in the context of the fact that facebook has released llama 2 with the possibility of commercial use