facebookresearch / denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Other
1.65k stars 302 forks source link

fix: init `conv_state` and `lstm_state` when calling `flush` method #139

Closed hoyaaaa closed 1 year ago

hoyaaaa commented 1 year ago

fix: init conv_state when calling flush method

Resolves #137

adefossez commented 1 year ago

hey @hoyaaaa thanks for the PR, I think you would also need to reset the lstm_state to None too no ?

hoyaaaa commented 1 year ago

hey @hoyaaaa thanks for the PR, I think you would also need to reset the lstm_state to None too no ?

Thank you for your comment. Oh, I missed that point. , I will update it. Thank you.

adefossez commented 1 year ago

Awesome thanks for the fix.