RookieJunChen / Inter-SubNet

The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.
Apache License 2.0
92 stars 12 forks source link

training dataset #6

Open PanYenChou opened 1 year ago

PanYenChou commented 1 year ago

I'm trying to retrain your InterSubnet. I have some questions about training datasets.

  1. I saw you used interspeech 2020 datasets in your code but the subset of the interspeech 2021 datasets in your paper. Which is correct ?
  2. If you use the subset of interspeech 2021 datasets, what kind of dataset did you use? Fullband or Wideband? Only use clean read_speech or do you use emotional speech and non-English speech? Thanks
RookieJunChen commented 1 year ago

Our training dataset mainly consists of the wideband (16kHz) data from the DNS Challenge at InterSpeech 2021. The clean dataset used is "readbook" and the noise dataset incorporates the complete set of noises. Our test dataset utilizes the referenced test set released in the DNS Challenge at InterSpeech 2020.

PanYenChou commented 1 year ago

Did you use drop band that reaches the performance in your paper?

RookieJunChen commented 1 year ago

The configs of my final reported results are listed in config. If there is a slight difference in your results, I think it's likely to be a randomness difference due to the dynamic mixing strategy. Besides, the type of GPU also has some effect on the results.