speaker-embeddings Search Results

1000+ results
for speaker-embeddings

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

modelscope/3D-Speaker #149

[ENG] CAM++ EER Results on VoxCeleb-O/E/H datasets

Hello! I've a question about EER results that you've got in research paper. My question addressed only for english version of CAM++. (Because with chinese version all results look right) First of all…

K1ndWha1e updated 4 days ago
4
m-bain/whisperX #371

Train/Recognize speaker?

Given previously recorded and recognized speaker embeddings used for diarization, it seems like it would be possible to match any new voice to a previously recorded database of known voices with assoc…

sam1am updated 1 year ago
2
m-bain/whisperX #840

How to use a fine-tuned segmentation model for diarization?

I have a WhisperX Python script for transcribing meetings, but the speaker diarization for German is really bad, unfortunately. After some research I came across the fine-tuned German segmentation…

Arche151 updated 2 months ago
6
RVC-Project/Retrieval-based-Voice-Conversion-WebUI #662

Multi-speaker training failing

Hello, I just trained on two speakers at the same time. The filelist looks like this: ``` /home/ubuntu/RVC-beta-v2-0528/logs/merged/0_gt_wavs/0_4_48.wav|/home/ubuntu/RVC-beta-v2-0528/logs/merg…

Rolun updated 4 months ago
14
Lareina2441/LLaVA-Med #2

作者又在自言自语

torchrun --nnodes=1 --nproc_per_node=8 --master_port=25001 \ llava/train/train_mem.py \ --model_name_or_path /path/to/checkpoint_llava_med \ --data_path /path/to/your_dental_dataset.jso…

Lareina2441 updated 1 month ago
3
juanmc2005/diart #221

Implement voicefixer for audio enhancement

Is there any way to implement [voicefixer](https://github.com/haoheliu/voicefixer_main) to speaker diarization pipeline? The package takes a wav file as input and gives a upsampled 44100kHz wav file…

thieugiactu updated 10 months ago
6
EGO4D/audio-visual #12

Loading model for audio embeding

Hi, I am trying to extract the audio features from the clips. I've downloaded the clips and then I run run the code 'batch_audio_embedding.py'. (inside the folder audio-visual/active-speaker-detect…

emanuele-mincato updated 2 months ago
1
jaywalnut310/vits #123

Multis-speaker identity degradation

Hi! I've encountered a problem I have multi speaker dataset. If I train a separate model for speaker (single speaker model) - prosody, speed, intonations, timbre, identity are good (for the spe…

NikitaKononov updated 1 year ago
4
microsoft/unilm #802

how to use wavlm model to extract speaker embedding for spea…

Hi, I wanna use **wavlm** model to extract speaker embedding for speaker verification task. In [the paper](https://arxiv.org/pdf/2110.13900.pdf) it is mentioned that for the task of speaker verificat…

fatemeshiravand updated 2 years ago
2
lesterphillip/SVCC23_FastSVC #2

FastSVC implementation improvements

Adding here some implementation improvements that I need to do courtesy of comments from @r9y9 - [XX] Change F0 to log-F0 (and continuous) - [] Use original speaker embedding during training, - …

lesterphillip updated 11 months ago
2

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for speaker-embeddings

1000+ results
for speaker-embeddings