This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
After read the README.md. I know that, output of current prediction is not relative to previous prediction. So, how we can use this model for online streaming audio?
My background
Have I read the README.md file?
yes
Have I searched for similar questions from closed issues?
yes
Have I tried to find the answers in the paper Fully Supervised Speaker Diarization?
yes
Have I tried to find the answers in the reference Speaker Diarization with LSTM?
yes
Have I tried to find the answers in the reference Generalized End-to-End Loss for Speaker Verification?
Describe the question
After read the README.md. I know that, output of current prediction is not relative to previous prediction. So, how we can use this model for online streaming audio?
My background
Have I read the
README.md
file?Have I searched for similar questions from closed issues?
Have I tried to find the answers in the paper Fully Supervised Speaker Diarization?
Have I tried to find the answers in the reference Speaker Diarization with LSTM?
Have I tried to find the answers in the reference Generalized End-to-End Loss for Speaker Verification?