Closed NNPanNPU closed 1 year ago
Did you try SISDR loss? How many microphones in your experiments? Which RIR set is used in your experiment? And what's the speech overlap type in your dataset?
If you can provide your dataset (clean speeches & RIRs), we can try to train NBC for you
Wow, thank you! Here are my experimental settings.
(1) There are 8 microphones in the ULA, of which the inter-element space is 4cm. (2) T60 is randomly selected between 200--900 ms. (3) Room size: x is randomly selected between [2,12]m, y is from [2,10]m, z is from [3,4]m. (4) The training data is from WSJ0. Each clean speech is contaminated by an interference (SIRs are the same as in WSJ0-2mix) and white noise (with SNR of 20dB--40dB). The interference is at least 30 degrees away from the clean speech.
I tried SISDR loss, which is unfortunately around -10dB. Do you have any experience on the situation? Is the problem related to the interference's position or the reverberation time?
Thanks again!
Oh, one more question is what's the speech overlap way in your experiments, both train and test. The overlap ways need to be the same for training and testing, or the overlap ways used at training should roughly include the overlap ways used at testing.
Thanks for your answer and patience. I tried the code on circular array and half spherical array. Both of them work well. I guess ULA might be the reason. Though, very weird. BTW, what do you mean by "overlap ways"?
Great. We didn't know that the NBC doesn't perform well on ULA before. Thank you for finding that. Our upcoming work might not have this problem. NBC doesn't perform well on ULA might because ULA doesn't provide spatial information as much as circular array and half spherical array, that is fatal for the narrow-band method which fully relies on the spatial information to separate.
Below is the four overlap ways of one speech pair we considered in our code.
I'm closing this issue now. If you have more questions, you are welcomed to reopen it. Thank you for paying your attention to our work. ^_^
@NNPanNPU Hello, we have revised NBC (NBC2) in this repo. It works well even with two microphones, so it should be OK with ULA.
@NNPanNPU Hello, we have revised NBC (NBC2) in this repo. It works well even with two microphones, so it should be OK with ULA.
Thanks! This is a really nice work.
I tried the code on uniform linear array and SDR is used as loss. However, the training loss is around -7dB---8dB, which is much higher than the circular array. Are there any possible reasons? Do you have any suggestions on ULA?