google / uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
https://arxiv.org/abs/1810.04719
Apache License 2.0
1.56k stars 319 forks source link

[Question] How can I get the text of each speaker after speaker diarization? #59

Closed rowhitswami closed 5 years ago

rowhitswami commented 5 years ago

Describe the question

A clear and concise description of what the question is. I want to perform some sentiment analysis work after speaker diarization. This project looks very promising for my task. It would be really great if you can help me out.

My background

I'm a 4th-year engineering student from India, currently working as a Software Engineering Intern as InterviewBit

Have I read the README.md file?

Have I searched for similar questions from closed issues?

Have I tried to find the answers in the paper Fully Supervised Speaker Diarization?

Have I tried to find the answers in the reference Speaker Diarization with LSTM?

Have I tried to find the answers in the reference Generalized End-to-End Loss for Speaker Verification?

wq2012 commented 5 years ago

You need ASR. Unfortunately we don't work on that.

rowhitswami commented 5 years ago

Any reference would be a great help. Thank you @wq2012

wq2012 commented 5 years ago

Just Google search ASR, or automatic speech recognition.