shahules786 / mayavoz

Pytorch based speech enhancement toolkit.
MIT License
328 stars 21 forks source link

Quality not similar to example (would you like me to upload somewhere?) #51

Open boutell opened 1 year ago

boutell commented 1 year ago

I gave this a try with a one-minute sample of a speaker in a noisy room. It's very intelligible to start, but after running through the filter I just got pops and squeaks. I then tried normalizing it first, which produced an intelligible result, but very artificial and generally not as good as the original.

This is not a complaint! More a query to see if you are interested to have the before and after samples.

I'm using the technique in the README, along with the save_output flag. My input file is 44k mono WAV.