rupakvignesh / Singing-Voice-Separation

A Modular Deep Neural Network framework for Singing Voice Separation.
6 stars 1 forks source link

The performance of DNN #1

Open ShengleiH opened 6 years ago

ShengleiH commented 6 years ago

Hello, thanks for your code sharing.

I am also working on singing voice separation tasks. Unfortunately, my result is not good, I get 5.38dB vocal GNSDR. I trained my model on mir-1k dataset with 175 song clips and I did data augmentation.

I am wondering what's your best result?

Thank you a lot :)

rupakvignesh commented 6 years ago

My results are around the same range and I'm currently working on revamping the code a little bit. During training, you can also try to reconstruct vocals while shuffling the background accompaniment.

ShengleiH commented 6 years ago

@rupakvignesh Thank you for your rapid reply. Do you mean data augmentation? Actually I have done data augmentation by shifting vocal and then combining it with the original background accompaniment. And the paper I referred to achieves 6.44dB vocal GNSDR, have you ever achieved this results?

rupakvignesh commented 6 years ago

In the current implementation, my loss function is different from the original paper's (I was keen to see if estimating just vocals (either directly or with time-freq masks) will be good -- but I was not able to get close to the results on paper.

Right now, I'm implementing the one suggested in paper and it will be up by the end of this week.

ShengleiH commented 6 years ago

@rupakvignesh Thank you ~ :)

rupakvignesh commented 6 years ago

Hello @ShengleiH, please check out the new commits and let me know if you have any questions.