-
Hey Developers!
I was running the diarization (demo2) code to get line graphs for different speakers in a recording with the "similarity" variable on the Y axis.
However, could you help me unde…
-
Hi,
I read your paper Speaker Diarization using Deep Recurrent Convolutional Neural Networks for speaker embedding. The details were very clear regarding the convolutional part.
But for the 2 rec…
-
Thank you for sharing your git.
My question is same above.
Does this work with the unseen speech?
-
Hello
I'm tiring to make new model on VCTK dateset first I'm make generate speaker embedding by using code python generate_embeddings.py, Now Traing model by using file train.py but i have proble…
-
Hello, I would like to use ecapa vectors for anonymization. Can the gan.pt file you provided be directly used for the anonymization training of ecapa vectors? Or do you need to train a new gan.pt file…
-
I have a speech transcript dataset in which ellipses (...) have been used to indicate the speaker pause. I am using BERT embeddings for text classification. It is very important for me that the BERT m…
-
it seems that current implementation is designed to single vocal voice like LJspeech dataset which u used.. this dataset is 24 hour audio recording of single vocal.
I have a dataset of hundred of …
-
Speaker diarization is where you annotate a transcript by noting which words were spoken by which speakers.
There are tools in Python that do this. It would be great to try them out and see if any …
-
Hello,
I am trying to interpolate between two speakers. I am using the model pretrained on LibriTTS.
I have read the issue "How is interpolation between speakers performed?" #33 but I still…
-
Hi,
I am trying to run a pipeline to extract embeddings
The pipeline I am running is the one in the README:
```
import rx.operators as ops
import diart.operators as dops
from diart.source…