[Question] Which feature was used for VAD?

google / uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Apache License 2.0

1.56k stars 319 forks source link

Describe the question

Hi, thanks for open-sourcing this awesome project. Which feature was used for VAD? d-vector or PLP features (as you mentioned in "Speaker Diarization With LSTM") ?

My background

Have I read the README.md file? yes Have I searched for similar questions from closed issues? yes Have I tried to find the answers in the paper Fully Supervised Speaker Diarization? yes Have I tried to find the answers in the reference Speaker Diarization with LSTM? yes Have I tried to find the answers in the reference Generalized End-to-End Loss for Speaker Verification? yes

google / uis-rnn

[Question] Which feature was used for VAD? #48

Describe the question

My background