speech-activity-detection Search Results

855 results
for speech-activity-detection

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mitll/pyslgr #1

Training model in pySLGR

Hi, I am trying to use pySLGR to develop a audio to text transcription system. After going through the example programs and documentation, I was wondering does pySLGR provide methods only for feature …

vorugant updated 5 years ago
7
m-bain/whisperX #536

Audio Transcription Issue with WhisperX Compared to Standard…

# Context I have been working on a personal project for several weeks that utilizes WhisperX for speech recognition. Recently, I decided to experiment with a custom Voice Activity Detection (VAD) sys…

davidlandais updated 11 months ago
3
OpenNMT/CTranslate2 #1648

Whisper batch generation is not faster than loops

In CTranslate2 Whisper model, batch generate is not faster than looping one by one. I tried the same thing on Translator model and it shows batching is far superior (a lot faster). I used Whisper smal…

evanarlian updated 7 months ago
5
kaegi/alass #7

Troubleshooting wrong alignment

I was wondering how the language-agnostic part works, since on my first few quick tests, it generated a totally wrong output for Dutch subtitles, but a perfect one for English subs. The dutch output …

davidde updated 5 years ago
6
yyf17/awesome-embodied-intelligent #1

SoundSpace

# [sound-spaces](https://github.com/facebookresearch/sound-spaces) [Project: RLR-Audio-Propagation](https://github.com/facebookresearch/rlr-audio-propagation) [Audio Sensor](https://github.com/f…

yyf17 updated 2 years ago
1
JuliaDSP/Roadmap #3

Speech.jl?

I would like to start drafting a new package for speech signal processing, focused mainly on speech feature extraction (MFCCs, LPCs, fundamental frequency, etc). @davidavdav has a lot of work on MFCCs…

jfsantos updated 6 years ago
8
balkce/soundloc #1

a question about this project

Hi,I am interested in this project, (I use jack as input but with an array with 6 mics ,and I choose three of them as a triangle array) however,when I run roslaunch soundloc soundloc.launch,terminal …

severusbunny updated 5 years ago
1
target/goalert #921

voice: voicemail/answering machine detection

**Is your feature request related to a problem? Please describe:** - When GoAlert leaves a voicemail the beginning of the message is cutoff - The UI doesn't distinguish between a voice call bein…

mastercactapus updated 1 year ago
6
marsbroshok/VAD-python #11

Does not seem to work with FLAC files

When attempted, throws the error: "ValueError: File format b'fLaC'... not understood."

karkirowle updated 5 years ago
3
fedirz/faster-whisper-server #108

Feature Request + my code: audio cleanup

Hi, I'm a happy user of faster-whisper-server. I mainly use it as a whisper backend for [open-web-ui](https://github.com/open-webui/open-webui/) and recently opened an issue to share my code for hi…

thiswillbeyourgithub updated 1 month ago
7

上一页 1...5 6 7 8 9 10 11...86 下一页

855 results for speech-activity-detection

855 results
for speech-activity-detection