-
### Describe the feature
If I provide a audio file with multiple channels - e.g. a m4a where it was recorded with multiple microphones, vibe currently only transcribes the first channel :(
good = …
-
Is there a way to return the word timestamp of a sentence?
example:
input sentence: "Hello readers,welcome!"
output:
[{
"word": "Hello",
"start_time": 0.02,
"end_time": 0.36,
},
{
"word": …
-
I use this line of code to transcribe and diarize at the same time :
```python
!pipx run insanely-fast-whisper --file-name "/content/drive/MyDrive/aurore.wav" --hf_token
```
but I get more s…
-
I have a WhisperX Python script for transcribing meetings, but the speaker diarization for German is really bad, unfortunately.
After some research I came across the fine-tuned German segmentation…
-
### Description
Train ECAPA-TDNN on adults' and children's voices
### Tasks
- [ ] explore the impact of training data window on speaker diarization performance (in a child-adult setting) and on spe…
-
Hi.
First of all thank you for your project!
I have adapted your previous version (Q1 2024) and have been using it successfully.
There is one problem that I couldn't solve.
The main audio lan…
-
Hello! I would like to use WhisperX and Pyannote to combine automatic transcription and diarization. I can do it on Colab using the Huggingface (HF) token, but I would like to avoid entering the HF to…
-
When the first epoch ends, I've an error in the evaluation because there are some speakers/labels that are not present on label_encoder.txt
```
(.venv) root@8ad8297faf3e:/home/diarization/speechbr…
-
## Goal
- Remove the need to press the button, detect the voice
- Medium-term
- Enables ambient voice detection
- Enables interruptibility
- Small model that has binary classifier for vo…
-
### What happened?
A bug happened!
### Steps to reproduce
1. step one...
2. step two...
### What OS are you seeing the problem on?
_No response_
### Relevant log output
```shell
options: {
…