speaker-diarization Search Results

1000+ results
for speaker-diarization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

m-bain/whisperX #841

Use whisperx and pyannote in Colab without HuggingFace token

Hello! I would like to use WhisperX and Pyannote to combine automatic transcription and diarization. I can do it on Colab using the Huggingface (HF) token, but I would like to avoid entering the HF to…

biagioscalingipsy updated 2 months ago
1
Vaibhavs10/insanely-fast-whisper #152

Change pyannote settings using Insanely-Fast-Whisper?

I use this line of code to transcribe and diarize at the same time : ```python !pipx run insanely-fast-whisper --file-name "/content/drive/MyDrive/aurore.wav" --hf_token ``` but I get more s…

Maldoror1900 updated 2 months ago
5
resemble-ai/Resemblyzer #72

The Ability of Speaker Diarization with More Than 2 Speakers

Hello developers! Thank you so much for developing Resemblyzer and it is an amazing tool for me. I have actually been encountered a problems while developing, that when my input audio contains 3 spea…

ConnieZi updated 11 months ago
3
saharmor/whisper-playground #10

Use pyannote-audio for speaker diarization

Logic will be to combine Whisper + pyannote.audio based on timestamps to output something along the lines of: ``` Person A: Hi Person B: Hello, how are you Person A: I'm good, and you? .... ```

saharmor updated 1 year ago
2
pyannote/pyannote-audio #1621

Diarization pipeline v3.1 is much slower than 3.0 when runni…

### Tested versions Tested on 3.1 vs 3.0 ### System information Debian GNU/Linux, torch 2.1.2 ### Issue description When running diarization pipeline on CPU, v3.1 is more than 2x slower…

a-rogalska updated 4 months ago
22
m-bain/whisperX #751

Hugginface Diarization Authentication Issue

Keep getting this error whether I try diarization 3.0 or some other version despite accepting the user aggrements on HF - are there any fixes here: torchaudio.set_audio_backend("soundfile") Could…

vishjain updated 4 months ago
1
m-bain/whisperX #295

Hyperpyyaml throws error in speaker diarization

hyperpyyaml >>Performing transcription... >>Performing alignment... >>Performing diarization... Lightning automatically upgraded your loaded checkpoint from v1.5.4 to v2.0.2. To apply the upgrad…

sridharta updated 1 year ago
2
thewh1teagle/vibe #220

[Feature Request]: multi channel audio input

### Describe the feature If I provide a audio file with multiple channels - e.g. a m4a where it was recorded with multiple microphones, vibe currently only transcribes the first channel :( good = …

chrisns updated 1 month ago
2
modelscope/FunASR #1508

question for told model

hi, thanks for your codes. I am trying to use model of "TOLD : A novel two-stage overlap-aware framework for Speaker Diarization", but cannot find the model(Found only eend-ola code). How can I expe…

freshpearYoon updated 6 months ago
1
adrianco/meGPT #3

Process podcast for ingestion

Podcasts are usually conversations so voice recognition is needed to identify the *author* and extract question and answer pairs from the transcript. Similar to video ingestion.

adrianco updated 1 month ago
4

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for speaker-diarization

1000+ results
for speaker-diarization