voxceleb Search Results

827 results
for voxceleb

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

speechbrain/speechbrain #490

minimal_examples assume running on CPU only

As a beginner to this repo, I'd like to try out the examples on my own data and run on a gpu to make sure things are working. I noticed most of the examples do not have `batch = batch.to(self.devic…

piraka9011 updated 3 years ago
7
kaldi-asr/kaldi #4400

Speaker diarization for test wav files while using pre-train…

Hi, I am still new to Kaldi. I would like to perform diarization on some of the speech samples from my own dataset which do not have any speaker labels available, so I would have to listen and compare…

talal-sen updated 3 years ago
1
NVIDIA/NeMo #1710

Speaker recognition embeddings

I have been using https://github.com/NVIDIA/NeMo/tree/main/tutorials/speaker_recognition. There is a way we can get embeddings for speaker recognition. (https://github.com/NVIDIA/NeMo/blob/main/exa…

harrypotter90 updated 3 years ago
12
mozilla/TTS #512

Train a better Speaker Encoder

Our current **speaker encoder** is trained with only LibriTTS (100, 360) datasets. However, we can improve its performance using other available datasets (VoxCeleb, LibriTTS-500, Common Voice etc.). I…

erogol updated 3 years ago
79
speechbrain/speechbrain #297

Pretrained Models

Are pre-trained models available? Where can I find them? Thanks! Prashant

prashantserai updated 3 years ago
1
clovaai/voxceleb_trainer #55

Discussions for training / VoxSRC

- Changing `--n_mels` from 40 to 64 leads to a small increase in performance. - Using `--log_input` also leads to a small increase in performance. - Combining two loss functions (e.g. `angleproto` a…

joonson updated 3 years ago
33
joonson/syncnet_python #17

Want to grab where and whose the speech start and end

Hi, is it possible to extract what time (or where) the speech of each speaker start and end? I want to extract speech of each speaker so it needs to know when the speech matched to the speakers and e…

roshideen updated 3 years ago
1
speechbrain/speechbrain #630

How many times faster is DDP than DP in speechbrain?

I'd like to thank all the contributors for their efforts. We know that DDP should be faster than DP, but how many times faster is DDP than DP in speechbrain? I mean that I use 8 RTX 2080 Ti GPUs in a…

DanielMengLiu updated 3 years ago
9
lhotse-speech/lhotse #28

Example data preparation recipes

We should start creating example recipes for some data sets and tasks. I'll post an initial list here, and we can modify or extend it based on discussions. I'll sort it by the level of implementation …

pzelasko updated 3 years ago
7
pyannote/pyannote-audio #466

Does speaker embedding training require a different dataset …

In the tutorial, the AMI dataset is used to train speech activity and change detection. However, the voxceleb data set is used to train speaker embedding. Does the speaker embedding model necessarily …

kan-cloud updated 3 years ago
2

上一页 1...64 65 66 67 68 69 70...83 下一页

827 results for voxceleb

827 results
for voxceleb