speaker-embeddings Search Results

1000+ results
for speaker-embeddings

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

m-bain/whisperX #594

Diarization high memory usage not using dedicated gpu

Darization runs very slowly, uses almost 12gb of memory, and is seemingly not happening on the GPU (GPUz and Window's task manager show conflicting info) - Latest WhisperX repo - pyannote.audio 3…

Khaztaroth updated 11 months ago
2
SWivid/F5-TTS #377

Multiple reference audios?

Hi, is there a way to utilize multiple reference audios to capture more characteristics? I'm not to familiar how it works under the hood, but is some stacking or averaging possible to implement for…

kunibald413 updated 2 days ago
4
joonson/syncnet_trainer #9

Evaluation on list save

Hi, I am wondering what the reasoning behind the evaluation implemented in evaluateFromListSave is - it seems to me this is loading in 2 audio files, running the audio feature extractor on them, and c…

annadodson787 updated 4 years ago
1
taylorlu/Speaker-Diarization #55

Speaker-Diarization for 2 person conversation

@taylorlu, I would like to appreciate your effort for this repo! I have a small doubt though while trying the Speaker Diarization for .wav file with 2 speakers, I am getting output for 4 different spe…

ArvindSharma18 updated 3 years ago
3
stephbuon/democracy-lab #156

Write Blurb for "Collocates" (github repo)

(see syllabus for instructions).

stephbuon updated 2 years ago
1
yacineMTB/talk #4

Research - Dynamic speech reflex

Right now, I'm planning to initiate the response with a "vim pedal", aka a hotkey, because knowing when to respond is difficult. https://github.com/yacineMTB/talk/blob/master/index.ts#L108-L135 Whe…

yacineMTB updated 1 year ago
8
espnet/espnet #5713

X-vector based TTS model packaging broken in tts.sh

**Describe the bug** PR #5579 broke xvector-conditioned TTS model packaging. In stage 9 of `tts.sh`, `spk_xvector.ark` was replaced with `{spk_embed_tag}.ark`, which in my recipe resolves to `xvector…

G-Thor updated 7 months ago
1
jaywalnut310/glow-tts #28

Sharing my results. Glow-tts is incredibly impressive!

Thank you so much for developing such a high-quality, sparse, and performant network, @jaywalnut310. I thought I'd share the results I have obtained so that others can see how promising your network i…

echelon updated 2 years ago
5
speechbrain/speechbrain #1825

[Bug]: High memory consumption to create audio embeddings

### Describe the bug There is a lot of memory consumption while generating embeddings in encode_batch() function in the EncodeClassifier. How to reduce the memory consumption? ### Expected behav…

KalakondaKrish updated 3 months ago
6
CorentinJ/Real-Time-Voice-Cloning #1051

Poor attention with a different speaker encoder

First, Thanks for the excellent work by CorentinJ! I noticed that the speaker encoder used in this work is ge2e, performance of which is far fall behind the SOTA. So I replaced the ge2e encoder with …

MLrookie updated 2 years ago
3

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for speaker-embeddings

1000+ results
for speaker-embeddings