-
I would like to use sherpa-onnx for speaker diarization. However the current vad modal (silero) doesn't works well and doesn't detect speech correctly.
I tried another onnx model in the project [peng…
-
> Speaker diarization is the process of partitioning an audio stream into homogeneous segments according to the speaker identity.
Try to use pyannote to accomplish this. Try to download the entiret…
-
I am the creator and maintainer of pyannote.metrics and I just found out about your work.
I'd like to suggest that you contribute this new metric directly into the main pyannote.metrics.
That woul…
-
I'm using pyannote-onnx in conjunction with whisper.cpp, and I'm encountering an issue where whisper.cpp expects audio clips to be no longer than 30 seconds. However, pyannote sometimes detects speech…
-
may I ask why CUDA is not available for speaker ID?
cargo run --example max_speakers --features cuda -- 6_speakers.wav
When I add features cuda it works by the way :-)
https://github.com/t…
-
WhisperX diarization is done with Pyannote .
I'm using whisper-X for transcription in closed environment, no internet access.
It works well with whisper transcription , since we can download the…
-
Good evening,
I have tried looking for a solution in previous discussions issues and threads, with no luck - it could also be that I'm ignorant and could not recognize the issue and/or the solution i…
-
H:\AI\YouDub-webui\venv\Scripts\python.exe H:\AI\YouDub-webui\app.py
torchvision is not available - cannot save figures
Lightning automatically upgraded your loaded checkpoint from v1.2.7 to v2.2.0…
-
https://github.com/axinc-ai/ailia-models/tree/master/audio_processing/pyannote-audio
-
I am running tests with about 20 different audio files with different languages. I try the same audio file with both "diarize_whisper.rs" and "pyannote.rs". First of all I can say that segmentation an…