-
Hi!
I'm using the following command:
`insanely-fast-whisper --file-name intro.mp3 --language en`
And I get an output.json which looks like this:
`{
"speakers": [],
"chunks": [{
…
-
I think it would be great to be able to leverage WhisperX and speaker diarization. Any plans to do this?
https://github.com/m-bain/whisperX
-
Can someone guide me on how to incorporate speaker diarization into a transcription model?
-
## Goal
Provide speaker labels along with the transcriptions (eg. `Speaker1: ...`, `Speaker2: ...`)
Do it in the same time when transcribing efficient and lightweight.
## Research
https://gi…
-
# Task Name
Speaker Diarization with ASR
[Description]: To do multi-speaker ASR where each speeches may have overlap.
## Task Objective
Most of the time, we do ASR on audio with only one main sp…
-
Dsnote is great for STT by using whisper. For audio samples with different persons speaking, e.g. podcasts, movies …, one ends up with a messy text because Whisper doesn’t do what’s called ‘speaker di…
-
I realize this isn't included in Whisper out of the box but would love to see this as an additional feature.
Is that something you've at all considered adding?
-
### Tested versions
3.3.0
### System information
win10
### Issue description
I am trying use below code to separate an audio, diarization labels is 3, but when s = 1, sources.data[:,s]…
-
I get a lot of "Speaker?" in the final file and i do not know how to improve this.
Maybe you can give a few tips how to work with the pipeline.
-
This request is a long and difficult shot, but it would definitely help if we had [speaker diarization](https://www.rev.com/blog/transcription-blog/what-is-speaker-diarization) tools that identifies w…