I'm having trouble understanding ECAPA-TDNN architecture.
To be specific, I don't understand what does the elements in ECAPA-TDNN do (PreEmphasis,MelSpectrogram,FBankAug,conv1d,relu, batchNorm1d, bottleneck, Attention...) in the context of speaker verification?
What about classifier AAAsoftmax, optimizer Adam and scheduler stepLR?
Hi!
I'm having trouble understanding ECAPA-TDNN architecture.
To be specific, I don't understand what does the elements in ECAPA-TDNN do (PreEmphasis,MelSpectrogram,FBankAug,conv1d,relu, batchNorm1d, bottleneck, Attention...) in the context of speaker verification?
What about classifier AAAsoftmax, optimizer Adam and scheduler stepLR?
Thanks for your attention and time!