Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Other
1.65k
stars
302
forks
source link
ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 88 from C header, got 80 from PyObject #112
I ran the python train.py in the quickstart example and I got this-
[2022-02-10 10:51:28,418][__main__][ERROR] - Some error happened
Traceback (most recent call last):
File "train.py", line 105, in main
_main(args)
File "train.py", line 99, in _main
run(args)
File "train.py", line 25, in run
from denoiser.solver import Solver
File "/home/ananyapam/Projects/Skit/denoiser/denoiser/solver.py", line 19, in <module>
from .evaluate import evaluate
File "/home/ananyapam/Projects/Skit/denoiser/denoiser/evaluate.py", line 14, in <module>
from pesq import pesq
File "/home/ananyapam/anaconda3/envs/denoiser/lib/python3.7/site-packages/pesq/__init__.py", line 6, in <module>
from .cypesq import cypesq, cypesq_retvals, cypesq_error_message as pesq_error_message
File "pesq/cypesq.pyx", line 1, in init cypesq
ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 88 from C header, got 80 from PyObject```
I ran the
python train.py
in the quickstart example and I got this-