extracted-speaker-embeddings Search Results

k2-fsa/sherpa-onnx #1460

Feature: Extracting speaker embeddings during diarization

My task combines both speaker diarization and speaker identification. Since speaker embeddings are extracted during diarization anyway, it would be fantastic if the user could extract speaker embed…

WilliamVenner updated 1 week ago

kaldi-asr/kaldi #4944

New to using Kaldi, just need a model to extract good voice …

Does anyone have an example python script that uses one on the x-vector extraction models developed here to extract embeddings? I've gone through some of the repo and have not found any such thing. …

PhilipAmadasun updated 1 month ago

cyrta/broadcast-news-videos-dataset #3

paper details

> More precisely, we use activations from the last layer of neural network as speaker embeddings. We aggregate the sigmoid outputs by summing all outputs class-wise over the whole audio excerpt to obt…

venkatesh-1729 updated 6 years ago

m-bain/whisperX #840

How to use a fine-tuned segmentation model for diarization?

I have a WhisperX Python script for transcribing meetings, but the speaker diarization for German is really bad, unfortunately. After some research I came across the fine-tuned German segmentation…

Arche151 updated 2 months ago

microsoft/SpeechT5 #50

SpeechT5: extracting Chinese speaker embedding

Hi, I have the same question as https://github.com/microsoft/SpeechT5/issues/16#issuecomment-1516257038. My training dataset is Chinese, so can i use speechbrain/spkrec-xvect-voxceleb to extract speak…

QQ-777777 updated 1 year ago

auspicious3000/contentvec #14

How to get a new spk2info.dict?

I want to train a new model with other dataset,but I don't find the way to get a new spk2info.dict.

gu76h updated 1 year ago

microsoft/UniSpeech #46

Speaker verification result

Hello, Thank you for your work on WavLM. I try to reproduce the results but I have some difficulties. First of all, I don't undestand exactly the difference between scores displayed in differen…

pierfale updated 1 month ago

lesterphillip/SVCC23_FastSVC #2

FastSVC implementation improvements

Adding here some implementation improvements that I need to do courtesy of comments from @r9y9 - [XX] Change F0 to log-F0 (and continuous) - [] Use original speaker embedding during training, - …

lesterphillip updated 11 months ago

SWivid/F5-TTS #377

Multiple reference audios?

Hi, is there a way to utilize multiple reference audios to capture more characteristics? I'm not to familiar how it works under the hood, but is some stacking or averaging possible to implement for…

kunibald413 updated 2 days ago

semperai/amica #41

How is possible to add new speechT5 models

How is possible change the text to speech model ? Is possible to use other .bin like voxpopuli for Italian language or other trained by ourself ? I try to add the voxpopuli.bin file in the public dire…

virtualrobotix updated 10 months ago

91 results for extracted-speaker-embeddings

91 results
for extracted-speaker-embeddings