This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
A clear and concise description of what the question is.
My background
Have I read the README.md file?
yes/no - if you answered no, please stop filing the issue, and read it first
Have I searched for similar questions from closed issues?
yes/no - if you answered no, please do it first
Have I tried to find the answers in the paper Fully Supervised Speaker Diarization?
yes/no
Have I tried to find the answers in the reference Speaker Diarization with LSTM?
yes/no
Have I tried to find the answers in the reference Generalized End-to-End Loss for Speaker Verification?
yes/no
Hello, preparations have been completed. Now we want to train our voice data. I want to ask three questions: 1. How many speakers do we need at least? 2. How long does each speaker need to say a few words? 3. Does each speaker need to speak the same sentence? Thank you for your guidance.
Describe the question
A clear and concise description of what the question is.
My background
Have I read the
README.md
file?Have I searched for similar questions from closed issues?
Have I tried to find the answers in the paper Fully Supervised Speaker Diarization?
Have I tried to find the answers in the reference Speaker Diarization with LSTM?
Have I tried to find the answers in the reference Generalized End-to-End Loss for Speaker Verification?