Open moshengmao opened 10 months ago
Hello!I also meet this problem.I have modified the resample rate to 16000 and my result is similar to you.Have you solved this problem yet?
Use ffmpeg to downsample audio files.
Hello!I also meet this problem.I have modified the resample rate to 16000 and my result is similar to you.Have you solved this problem yet?
pesq: 2.2732677546519677 csig: 3.7115074899207685 cbak: 2.772968618267093 covl: 3.022599602088685 ssnr: 2.0236725814308003 stoi: 0.8991248361647437
This is my results...
When I use VCTK-DEMAND dataset to test, I found the sample_rate of wavs in the VCTK-DEMAND/test/ is 48000, but evaluation.py
so I add lines to resample,
` noisy, sr = torchaudio.load(audio_path)
audio_path VCTK-DEMAND/test/noisy/p232_001.wav sr 48000
noisy_np = noisy.numpy()
noisy_resampled_np = librosa.resample(noisy_np, sr, 16000)
noisy = torch.tensor(noisy_resampled_np)
sr = 16000
noisy = noisy.cuda().to(device)
`
and generate some wavs. But the audio quality of the WAV file is very poor. It's hard to make out. How do you resample to 16000? Maybe my way to resample is wrong?
And the result is