-
Hi!
I'm using the following command:
`insanely-fast-whisper --file-name intro.mp3 --language en`
And I get an output.json which looks like this:
`{
"speakers": [],
"chunks": [{
…
-
Diarizing a 20 minutes of audio is taking roughly 45 seconds time. Could you please let me know if the optimizations of insanely-fast-whisper focused just on whisper or we are planning to add some opt…
-
I think it would be great to be able to leverage WhisperX and speaker diarization. Any plans to do this?
https://github.com/m-bain/whisperX
-
If we know exact speakers in wav file (i.e. 2), is it some way to provide it to diarization pipeline?
Because library often determine an additional speakers - speaker02, speaker03 and so on.
Somethi…
-
### Tested versions
Tested on 3.1 vs 3.0
### System information
Debian GNU/Linux, torch 2.1.2
### Issue description
When running diarization pipeline on CPU, v3.1 is more than 2x slower…
-
Whisper model has limitation of 30s.
Can you integrate batch inference into sherpa?
I would like to use it along with the diarization.
I'm still not sure how exactly it possible to batch it but I…
-
**It would be great if users could update the speaker themself:**
1. as a user i want the ability to update the speaker's name of a transcript segment.
2. as a user i want an option to update speaker …
-
现在版本所使用的pyannote.audio-2.1.1说话人分离功能准确很率
希望博主有时间能尝试下pyannote-audio最新的3.3.1版本
还有就是reverb-diarization-v1和reverb-diarization-v2
这2个模型,看介绍都比pyannote-audio-3.0 要好20%以上
https://huggingface.co/Revai/rev…
dfrrk updated
1 month ago
-
Hi,
Is speaker diarization planned (espec. in realtime)?
Thx!
-
I get a lot of "Speaker?" in the final file and i do not know how to improve this.
Maybe you can give a few tips how to work with the pipeline.