-
My task combines both speaker diarization and speaker identification.
Since speaker embeddings are extracted during diarization anyway, it would be fantastic if the user could extract speaker embed…
-
type IsSpeaking
bool
type WhoIsSpeaking
uuid
known speakers
[chat on diarization embeddings](https://chatgpt.com/share/6704175b-9184-800f-bc01-2076a8af85bf)
[chat on running models locall…
-
Hi! I see that this repo hasn't been touched in a while...are there any plans to fold diarization into RealtimeSTT? Thanks!
-
I've added Toucan to the TTS Arena fork by using the MassivelyMultilingualTTS space.
Arena: https://huggingface.co/spaces/Pendrokar/TTS-Spaces-Arena
TTS Space: https://huggingface.co/spaces/Flux9665…
-
### Project Name
VidSage
### Description
# VidSage: Video Insights using Graph RAG
https://www.youtube.com/watch?v=IUSCWtB9jWk
VidSage focuses on processing video data, storing it in Azur…
-
Hi,
Is there a way to randomly add a codec compression as a data augmentation when training speaker embeddings ? Is it already done in current pre-trained models ?
Things like Opus, MP3 etc.. bu…
-
### Tested versions
3.1
### System information
macOs 13.6 - pyannote 3.1 - M2 air
### Issue description
Im running ```
self.pipeline = Pipeline.from_pretrained(
"pyannote/speaker-diarizatio…
-
Hey everyone,
I am trying to use Pyannote with Whisper for transcribing meetings between my business partner and me, but the result hasn't been that great, since about 50% of the times, the wrong s…
-
### Tested versions
- pyannote-audio 3.1.1
### System information
windows 10 - pyannote.3.1.1 - rtx 3070
### Issue description
Diarization taking much longer than it should, using the progress ho…
-
Hey, I've fine tuned mello tts for indian accent and a few indian languages. I wanted to use the weights in the tone converter but realized voice_conversion expects the averaged tensor values for sour…