[Question] Can I use speaker annotated datasets in other language rather than English?

google / uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Apache License 2.0

1.56k stars 319 forks source link

Describe the question

Hi, This might be a naive question. But can I use speaker annotated speech corpora from different languages (English, Mandarin etc), combine them and train the speaker embedding component? Are speaker embeddings/UIS-RNN language independent?

My background

Have I read the README.md file? yes

Have I searched for similar questions from closed issues? yes

Have I tried to find the answers in the paper Fully Supervised Speaker Diarization? yes

Have I tried to find the answers in the reference Speaker Diarization with LSTM? yes

Have I tried to find the answers in the reference Generalized End-to-End Loss for Speaker Verification? yes

google / uis-rnn

[Question] Can I use speaker annotated datasets in other language rather than English? #44

Describe the question

My background