This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Sorry if this sounds like a dumb question. I am not an expert in eighter python or speaker diarization. After I have trained the model, how can I use it to determine how is speaking from a wave file. I am trying to determine how is speaking from a one audio telephone conversation.
Could I for example use test_test_sequence=wavfile.read(mywav) as a input to
predicted_cluster_id = model.predict(test_sequence, args), and get get a prediction of how spoke from this file?
My question is more about the use of the code. I hope you can help!
Sorry if this sounds like a dumb question. I am not an expert in eighter python or speaker diarization. After I have trained the model, how can I use it to determine how is speaking from a wave file. I am trying to determine how is speaking from a one audio telephone conversation.
Could I for example use test_test_sequence=wavfile.read(mywav) as a input to predicted_cluster_id = model.predict(test_sequence, args), and get get a prediction of how spoke from this file?
My question is more about the use of the code. I hope you can help!