-
Hi, I am trying to use pySLGR to develop a audio to text transcription system. After going through the example programs and documentation, I was wondering does pySLGR provide methods only for feature …
-
# Context
I have been working on a personal project for several weeks that utilizes WhisperX for speech recognition. Recently, I decided to experiment with a custom Voice Activity Detection (VAD) sys…
-
In CTranslate2 Whisper model, batch generate is not faster than looping one by one. I tried the same thing on Translator model and it shows batching is far superior (a lot faster). I used Whisper smal…
-
I was wondering how the language-agnostic part works, since on my first few quick tests, it generated a totally wrong output for Dutch subtitles, but a perfect one for English subs.
The dutch output …
-
#
[sound-spaces](https://github.com/facebookresearch/sound-spaces)
[Project: RLR-Audio-Propagation](https://github.com/facebookresearch/rlr-audio-propagation)
[Audio Sensor](https://github.com/f…
yyf17 updated
2 years ago
-
I would like to start drafting a new package for speech signal processing, focused mainly on speech feature extraction (MFCCs, LPCs, fundamental frequency, etc). @davidavdav has a lot of work on MFCCs…
-
Hi,I am interested in this project, (I use jack as input but with an array with 6 mics ,and I choose three of them as a triangle array)
however,when I run roslaunch soundloc soundloc.launch,terminal …
-
**Is your feature request related to a problem? Please describe:**
- When GoAlert leaves a voicemail the beginning of the message is cutoff
- The UI doesn't distinguish between a voice call bein…
-
When attempted, throws the error:
"ValueError: File format b'fLaC'... not understood."
-
Hi,
I'm a happy user of faster-whisper-server. I mainly use it as a whisper backend for [open-web-ui](https://github.com/open-webui/open-webui/) and recently opened an issue to share my code for hi…