voice-activity-detection Search Results

1000+ results
for voice-activity-detection

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

m-bain/whisperX #764

Incomplete transcription of Non-English audios

I am running WhisperX with large-v3 model. **When an audio is given, the transcription output ignores last 7-8 seconds and gives smaller transcript than the original answer.** - I evaluated the …

aayushNB updated 3 months ago
3
biggestT/toney #9

Smarter detection of non-speech sample

Currently there is only a threshold value in frequency spectrun variance that decides if a sample is noise or speech. Options to look into: - Noise gate: http://en.wikipedia.org/wiki/Noise_gate - Slid…

biggestT updated 10 years ago
1
rifflearning/zenhub #121

SPIKE: Speech Recognition (signal processing) Validation

As a Riff Developer, I am not confident that our speech detection is working correctly, based on the code that I've seen. Specifically, I'm concerned that we are not properly detecting actual speech v…

adonahue updated 4 years ago
10
zj1123581321/Adjust_SubTitle #1

中文幻觉问题

大佬，我在用whisper推理我业务数据的时候，经常出现连续很长的字或词的问题，有什么好的解决办法吗

wntg updated 6 months ago
1
kyungyunlee/ismir2018-revisiting-svd #2

Doubt: Double Stage HPSS calculated over first P component

I've been thinking a lot about this code fragment in https://github.com/kyungyunlee/ismir2018-revisiting-svd/blob/master/leglaive_lstm/audio_processor.py in function process_single_audio (Compute d…

Vichoko updated 4 years ago
1
ggerganov/whisper.cpp #1149

Andoid mic detects low decibel sound and trigger vad repeate…

First of all, I thank you Georgi Gerganov and all who contributed to this project. I have a progressive neuro-muscular disease and I almost can not use my hands. I bought a new android mobile to ease…

trappedinspacetime updated 2 months ago
8
modelscope/FunASR #1172

uniASR模型推理速度慢

运行环境：操作系统：linux python：3.8.16 modelscope:1.9.4 funasr: 0.8.4 gpu:T4 cuda:11.6 代码 ``` from modelscope.pipelines import pipeline from modelscope.utils.constant import Tasks from modelscope.…

MyWestCity updated 1 month ago
4
flashlight/wav2letter #851

How to get timestamp of each word ?

I would like to know is there any possible way we can get the timestamp of each word using wave2letter architecture? If so, how should we do it, please let me know regarding the same

manojmsrit updated 3 years ago
3
aikuma/aikuma #295

Speaking rate estimation and feedback

When asking a group of people produce clear speech, it's typical to observe wide variations such as how slow and 'clear' it is. Consultants often drift towards conversational speech rates to such a de…

Lingomat updated 9 years ago
1
amsehili/auditok #32

Quality Benchmarks Between audiotok / webrtcvad / silero-vad

Here I will post our benchmarks comparing these three instruments

snakers4 updated 3 years ago
3

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for voice-activity-detection

1000+ results
for voice-activity-detection