Open Ywandung-Lyou opened 2 years ago
TDNN is time delay neural network, which is originally used for speaker verification. I just use the front-end part of TDNN to construct a VAD network. cmvn should be cepstral mean varianace normalization for the input.
In that paper I do not find where the explanation for TDNN. And I do not understand what the function cmvn is and why. Could you explain for me please?