drethage / speech-denoising-wavenet

A neural network for end-to-end speech denoising
MIT License
673 stars 165 forks source link

Bad denoising results on the provided model #15

Open kfmn opened 6 years ago

kfmn commented 6 years ago

While the provided model in sessions/001 works well on NSDTSEA test files, the results on my own noisy files (recorded in real conditions) are much worse.

What did I do:

  1. Cloned the entire project
  2. Put my files into the 'test' subdir of the project dir
  3. Run the command THEANO_FLAGS=optimizer=fast_compile,device=gpu python main.py --mode inference --config sessions/001/config.json --noisy_input_path test
  4. Took results from sessions/001/samples/samples_1

Large part of speech was suppressed, although SNR is not very low. Maybe I do something wrong?

To try it yourself: https://drive.google.com/open?id=1njlPLNjbTuY1QlW_19y06a1ywuImBUHo

danhuixie commented 6 years ago

Same bad results with the pretrained model, even for the NSDTSEA test files. :(.

yunzqq commented 4 years ago

Hi, kfmn,

I want to know the parameter size of wanenet? But i use the python 3, there are some problems on them. I just want to know the size of this model. Can you tell me the results of model.summary()? Thank you very much!