-
[comment]: (Just paste "x" inside brackets, for example: - [x] Some positive statement)
**What problem are you facing?**
- [ ] audio isn`t recorded
- [ ] audio is recorded with artifacts
- [ …
-
### Introduction
Speaker diarization is the process of partitioning an input audio stream into homogeneous segments according to the speaker's identity. It can enhance the readability of an automatic…
-
Because of the error rate viz and above al speaker detection your whisper ui is better for research use than all the others I have tried. Please consider implementing Meta's MMS with speech recogniti…
-
### Useful links found in the book
1. [TransformersLibrary](https://github.com/huggingface/transformers)
2. [Rnn](http://karpathy.github.io/2015/05/21/rnn-effectiveness/) ![house_generate](https://…
-
Hello
A user on Stack Overflow (not me) has reported a problem with speech_recognition grabbing the audio from a Zoom call if you run a script whilst on Zoom:
https://stackoverflow.com/questions/6…
-
Hello,
I am interested in using pywhispercpp for speech recognition and speaker diarization.
I have installed the library and followed the instructions in the README file, but I am not sure how …
-
Hi, this work is really interesting. I would like to ask two informations...
Is it possible to realize a speaker diarization like this in real time? Hence, for example, while many people are speaki…
-
# Task Name
[Task name]: Target Speaker ASR
[Description]: Given a multispeaker speech utterance, decode the text corresponding to the specified speaker.
## Task Objective
Multispeaker ASR i…
-
The link to the sample files
` http://pannous.net/spoken_numbers.tar
`
in "Chapter 9 - Identifying speakers with voice recognition.ipynb" line 18 seems to be invalid. Where do i find those files?
…
-
https://github.com/alphacep/vosk-api/
This project has a libre, offline-capable speech recognition engine that's over 90% accurate - perfect for autoedit!
Vosk is a speech recognition toolkit. T…