Open Fangbo0506 opened 2 years ago
The the setup for the 50h is not shared here directly. There was some augmentation involved in creating these models which is not part of this codebase. It was basically just randomising the order of speech and noise files and randomising the SNR inside the training pipeline. Also 4s samples in a batch of 16 were used instead of 10s in a batch of 32.
My pesq can only reach 2.89 on the 50h dataset, which is inconsistent with the author's offer. But on 500h it can reach the author's 3.04.