speaker-embeddings Search Results

k2-fsa/sherpa-onnx #1460

Feature: Extracting speaker embeddings during diarization

My task combines both speaker diarization and speaker identification. Since speaker embeddings are extracted during diarization anyway, it would be fantastic if the user could extract speaker embed…

WilliamVenner updated 1 week ago

ebowwa/caringmind #26

Python & SWIFT: Diarization and Embeddings

type IsSpeaking bool type WhoIsSpeaking uuid known speakers [chat on diarization embeddings](https://chatgpt.com/share/6704175b-9184-800f-bc01-2076a8af85bf) [chat on running models locall…

ebowwa updated 2 weeks ago

KoljaB/WhoSpeaks #4

Integration into RealtimeSTT?

Hi! I see that this repo hasn't been touched in a while...are there any plans to fold diarization into RealtimeSTT? Thanks!

BCordleRossVideo updated 3 weeks ago

DigitalPhonetics/IMS-Toucan #202

HF Gradio demo: sudden gender flip for slider

I've added Toucan to the TTS Arena fork by using the MassivelyMultilingualTTS space. Arena: https://huggingface.co/spaces/Pendrokar/TTS-Spaces-Arena TTS Space: https://huggingface.co/spaces/Flux9665…

Pendrokar updated 1 week ago

microsoft/RAG_Hack #116

VidSage: Video Insights using Graph RAG

### Project Name VidSage ### Description # VidSage: Video Insights using Graph RAG https://www.youtube.com/watch?v=IUSCWtB9jWk VidSage focuses on processing video data, storing it in Azur…

MayankKeshariC5 updated 1 week ago

wenet-e2e/wespeaker #364

Compression codec augmentation ?

Hi, Is there a way to randomly add a codec compression as a data augmentation when training speaker embeddings ? Is it already done in current pre-trained models ? Things like Opus, MP3 etc.. bu…

tcourat updated 1 month ago

pyannote/pyannote-audio #1685

Speaker Diarization pipeline.get_segmentations produces inte…

### Tested versions 3.1 ### System information macOs 13.6 - pyannote 3.1 - M2 air ### Issue description Im running ``` self.pipeline = Pipeline.from_pretrained( "pyannote/speaker-diarizatio…

bschreck updated 1 month ago

pyannote/pyannote-audio #1750

Possible to use reference speaker embeddings in Pyannote dia…

Hey everyone, I am trying to use Pyannote with Whisper for transcribing meetings between my business partner and me, but the result hasn't been that great, since about 50% of the times, the wrong s…

Arche151 updated 2 months ago

pyannote/pyannote-audio #1687

Embeddings takes 3x the length of the audio length

### Tested versions - pyannote-audio 3.1.1 ### System information windows 10 - pyannote.3.1.1 - rtx 3070 ### Issue description Diarization taking much longer than it should, using the progress ho…

cdreetz updated 1 week ago

myshell-ai/OpenVoice #310

How do I use the weights trained on mello TTS with the conve…

Hey, I've fine tuned mello tts for indian accent and a few indian languages. I wanted to use the weights in the tone converter but realized voice_conversion expects the averaged tensor values for sour…

kaushal-gawri9899 updated 1 month ago

1000+ results for speaker-embeddings

1000+ results
for speaker-embeddings