-
Hi,
I use Voxceleb database and the kaldi recipe for speaker recognition.
now, I want to run this scrip on my own database.
I want to know, is there a kaldi recipe which received wave file (only on…
-
Hello! We have `speaker_linear_frame_libri` as speaker verification downstream task. Can we also have downstream task for that problem in more generalized way like in ge2e loss and training ([paper](h…
-
One option is the hdf5 data type.
Another is super fast data loading from one of our collaborators (I've forgotten which one).
-
How would I got about training and testing the model on the VoxCeleb dataset?
-
Thanks for incorporating multi-gpu support with _DistributedDataParallel_ . It's working as mentioned, but I have observed bug at the end of process. Below are the details:
For multi gpu trainin…
-
As a beginner to this repo, I'd like to try out the examples on my own data and run on a gpu to make sure things are working.
I noticed most of the examples do not have `batch = batch.to(self.devic…
-
Hi, I am still new to Kaldi. I would like to perform diarization on some of the speech samples from my own dataset which do not have any speaker labels available, so I would have to listen and compare…
-
Our current **speaker encoder** is trained with only LibriTTS (100, 360) datasets. However, we can improve its performance using other available datasets (VoxCeleb, LibriTTS-500, Common Voice etc.). I…
-
I have been using https://github.com/NVIDIA/NeMo/tree/main/tutorials/speaker_recognition.
There is a way we can get embeddings for speaker recognition. (https://github.com/NVIDIA/NeMo/blob/main/exa…
-
Are pre-trained models available? Where can I find them?
Thanks!
Prashant