speaker-embeddings Search Results

1000+ results
for speaker-embeddings

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

modelscope/3D-Speaker #147

Using CAM++ for Speaker Embeddings on 24kHz LibriTTS Data: S…

@yfchenmodelscope Hello! I'd like to run inference with CAM++ to extract speaker embeddings for the LibriTTS dataset. I noticed that the code converts the sampling rate to 16kHz during the MFCC featu…

xjf-303 updated 5 days ago
1
ylacombe/finetune-hf-vits #5

Speaker_id during inference

Hi @ylacombe! I have a multi-speaker data using which I have trained the hindi checkpoint. I wanted to generate a particular speaker's voice during inference. Is there any way to do that using the inf…

Srija616 updated 4 months ago
4
huawei-noah/Speech-Backbones #26

Model training question

Hi, thanks for sharing the code. I have a folder with wav files of different speakers. I don't understand what to do next to get the trained model. What type of files should be in the "mels" and "em…

Cpgrach updated 2 months ago
5
huggingface/transformers.js #914

Support wavlm-base-plus-sv with WebGPU

### System Info Transformers.js Alpha 10, Brave ### Environment/Platform - [X] Website/web-app - [ ] Browser extension - [ ] Server-side (e.g., Node.js, Deno, Bun) - [ ] Desktop app (e.g., Electron…

flatsiedatsie updated 2 months ago
3
auspicious3000/autovc #34

How you generate speaker embedding?

I am wondering about how you extract the speaker embedding with pre-trained verification model. The speaker embedding I get from [https://github.com/resemble-ai/Resemblyzer](url) will have a vector…

hsiehjackson updated 4 years ago
3
sanchit-gandhi/whisper-jax #51

OpenAI Whisper medium-model error while processing timestamp…

I am getting the following error when using "openai/whisper-medium" model with timestamp prediction: `There was an error while processing timestamps, we haven't found a timestamp as last token. Was W…

nachoh8 updated 1 year ago
7
fullstackporto/talks #5

Talk: A friendly introduction to word embeddings

## Title A friendly introduction to word embeddings ## Abstract We will discuss the limitations of traditional textual data representation methods and explore how we can do better. In the proces…

jm85martins updated 5 years ago
1
KoljaB/WhoSpeaks #1

It is really good But

Hey @KoljaB , I have tried this tool and it is surprisingly really good. It outperformed pyannote for sure. But I'm really wondering how it can be pushed for 10+ speakers or so. It would be really us…

francqz31 updated 7 months ago
4
auspicious3000/autovc #4

How to generate mel spectrogram

with the same wavenet model and the same utterence(p225_001.wav), i found that the quality of the waveform generated from the mel-spectrogram in provided metadata.pkl is much better than the one gener…

nkcdy updated 4 years ago
33
MahmoudAshraf97/whisper-diarization #191

multiple speaker compatability

Hi so first of all great work, the diarization works great for me on audio files with less than 3 speakers. Given an audio file with more than or close to 8 speakers, results in a very good transcri…

pgegg02 updated 2 months ago
16

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for speaker-embeddings

1000+ results
for speaker-embeddings