TaoRuijie / ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
MIT License
594 stars 113 forks source link

ECAPA-TDNN #50

Closed rosana-sc7 closed 1 year ago

rosana-sc7 commented 1 year ago

Hi!

I'm having trouble understanding ECAPA-TDNN architecture.

image

To be specific, I don't understand what does the elements in ECAPA-TDNN do (PreEmphasis,MelSpectrogram,FBankAug,conv1d,relu, batchNorm1d, bottleneck, Attention...) in the context of speaker verification?

What about classifier AAAsoftmax, optimizer Adam and scheduler stepLR?

Thanks for your attention and time!

TaoRuijie commented 1 year ago

Er sorry it looks too hard to answer your question.....Sorry