yxlu-0102 / MP-SENet

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
MIT License
267 stars 40 forks source link

Training details #3

Closed hbwu-ntu closed 3 months ago

hbwu-ntu commented 11 months ago

Hi, your paper and code are excellent! I have learned a lot about speech enhancement from the paper, and I find your code to be very well-structured and clear. Thank you so much!

I have some questions:

  1. Have you fulfilled the multi-gpu running. It seems the training will stuck in https://github.com/yxlu-0102/MP-SENet/blob/main/train.py#L146-L160. If not, maybe I can help you with this issue
  2. What is the batch size, gpu type and training time for your experiments?

Thanks in advance.

yxlu-0102 commented 11 months ago

Thank you for recognizing our work!

During the training process, we actually employed a multi-GPU strategy. Personally, I used two NVIDIA 3090ti GPUs to train the MP-SENet like CUDA_VISIBLE_DEVICES=0,1 python train.py --config config.json

The batch size was set to 4, resulting in a batch size of 2 for each GPU. The training process is expected to take approximately 3 to 4 days.