Audio-WestlakeU / FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
https://fullsubnet.readthedocs.io/en/latest/
MIT License
554 stars 158 forks source link

Date Preprocessing #39

Closed zhucq closed 2 years ago

zhucq commented 2 years ago

Hi,

I use the pretrain model and its always very good effect except some special case, fine tuning need in my case. Now I train the model like this: 1, download dns interspeech2020 branch data, split clean data to 6s length and 3s overlap, then every epoch model meet 500h noisy data; 2, I use 2080Ti * 8 and batch size set to 8. Other param just like train.toml; 3, the train loss look like this image Is there any other preprocess on the orign train data? And any advice for train in this little batch? Thank you

hungcaovu commented 1 month ago

@zhucq What was the issue ? I am facing the same problem. Could you give me some hints? Thanks,