-
For single file, it works fine.
Now I'm trying to process lots of files, for example 10000 audio files.
I made a script for loop processing, but it is loading diarization model in each calling.
I t…
ghost updated
3 years ago
-
Are input d-vectors for training assumed L2-normalized?
In [Generalized End-to-End Loss for Speaker Verification](https://arxiv.org/abs/1710.10467) they are defined as L2-normalized in eq. 4.
In…
-
Hello,
I am trying your code for dizrizaition by using another type of vector.
The problem is I need to change the input size.
In your training example, your vector length is 256.
If I have inpu…
-
## Describe the question
In my experiment, I find the speaker number predicted by uis-rnn model is more than real number.
I want to know how to control the different speaker number predicted by ui…
-
## Describe the question
I trained the uis-rnn model on embeddings obtained on timit data. I am calculating embedding over a 240 ms window with 50 % overlap. I am using this uis-rnn model to obtain s…
-
Hi bro.I'm interesting in your project and it is great.But the href does not exist.
Can you reupload the href,please?
-
## Describe the question
Hi, I have been working on this issue for almost a month.
I finally manage to get a good EER on the training of LSTM, and now training on UIS-RNN.
I have a question about…
-
## Describe the question
How do you take raw audio file annotated with speaker labels and convert them into a form that can be used by uis-rnn? There's no documentation for creating your own training…
-
-
> The flowchart of our diarization system is provided in Fig. 1. In this system, audio signals are first transformed into frames of width 25ms and step 10ms, and log-mel-filterbank energies of dimensi…