mcw519 / PureSound

Make the sound you hear pure and clean by deep learning.
7 stars 0 forks source link

about tse #1

Closed zuowanbushiwo closed 1 year ago

zuowanbushiwo commented 1 year ago

你好 请问 skim_causal_460_wNoise_IS_tsdr.ckptlibri2mix_max_2spk_clean_16k_1c.ckpt 这两个模型有什么不一样吗? 我用Netron 查看这两个模型,发现它们参数的结构都一样,只有loss 函数有一些不一样。skim_causal_460_wNoise_IS_tsdr.ckpt 效果是要好一些吗?使用了更多的数据训练? 谢谢!

mcw519 commented 1 year ago

Hi,

skim_causal_460_wNoise_IS_tsdr.ckpt trained by LibriSpeech 100 and 360 clean set and also adding noise. libri2mix_max_2spk_clean_16k_1c.ckpt trained only by LibriSpeech 100 clean set without adding noise.

And yes, skim_causal_460_wNoise_IS_tsdr.ckpt utilzed tSDR as loss function rather than Si-SDR to make sure the consistence of output signal gain.

thanks.

zuowanbushiwo commented 1 year ago

Hi Thank you very much for your reply, is skim_causal_460_wNoise_IS_tsdr.ckpt also only suitable for 2-speaker mix? Thanks!

mcw519 commented 1 year ago

Hi, In general yes, because it trained by 2-mix data. thanks.

zuowanbushiwo commented 1 year ago

Hi Thank you very much for your reply, One more question, consider the case of 2 speakers (one target speaker, one interfering speaker), also with some noise. Is it possible to separate the voice of the target speaker and the voice of the interfering speaker at the same time? This may be an ordinary speech separation problem, but I don't know if the separation will be more accurate when one speaker is known. Thanks!

mcw519 commented 1 year ago

Hi, TSE task will always extract the speech of target speaker only, so there are no interfering speaker or noise in the output signal. thanks.

zuowanbushiwo commented 1 year ago

Hi Which of the following causal models works best? Thanks

td_tse_conv_tasnet_v0_causal
tse_unet_tcn_v0_causal
tse_skim_v0_causal
tse_skim_v1_causal
tse_skim_v2_causal
mcw519 commented 1 year ago

Hi, There are some records in the readme page. you can check it there. thanks.

zuowanbushiwo commented 1 year ago

ok thanks. I read this readme, but there are only td_tse_conv_tasnet_v0_causal and tse_skim_v0_causal results.

mcw519 commented 1 year ago

hi, I think the tse_skim_v0_causal is the one you need. thanks.

zuowanbushiwo commented 1 year ago

好的,祝您生活愉快,工作顺利!