TaoRuijie / AVCleanse

ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'
30 stars 3 forks source link

about speaker-face step #1

Closed JINzezhong7 closed 11 months ago

JINzezhong7 commented 1 year ago

Thanks for you open source code. In paper, I guess you train speaker and face embedding network respectively. Then, use it to clean the data. But in speaker-face folder, you train speaker and face embedding network together. I want to ask why you do this. Thank you.

TaoRuijie commented 1 year ago

I just put them into the same code for training and evaluation, but two models are independent. The motivation is to check the multi-model speaker recognition results.