speech-processing Search Results

GasimV/Commercial_Projects #2

Speech Processing Models

`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…

GasimV updated 4 months ago

ronylpatil/youtube-comment-analyzer #7

🐞Bug Fixing & Enhancements

### **_Bugs to be fixed_** - [x] _While scrolling every time comments iteration begins from the first comment, there's some delay in processing new comments._ ### **_Enhancements_** - [ ] _try out wo…

ronylpatil updated 5 days ago

nvaccess/nvda #12778

Split speech processing commands and commands for synth

### Is your feature request related to a problem? Please describe. - Speech commands for synthesizers and speech commands for internal speech processing are not well separated. - The `SpeechManage…

feerrenrut updated 2 days ago

fluttergems/awesome-open-source-flutter-apps #672

Add FlutterVoiceFriend in Generative AI & LLMs

FlutterVoiceFriend is an open-source Flutter application designed to help developers build interactive, voice-driven chatbot experiences using a combination of speech-to-text (STT) and text-to-speech …

jbpassot updated 3 weeks ago

mediar-ai/screenpipe #585

[feature] adaptive noise reduction

Here are 10 approaches to implement adaptive noise reduction, ordered by complexity/effectiveness: ### 1. Enhanced Spectral Subtraction - Track noise floor during silence periods - Use overlappin…

louis030195 updated 1 month ago

livekit/agents #757

Implement Manual VAD Commit via Button for Controlled Speech…

I've implemented a button in the client that is supposed to ensure VAD (Voice Activity Detection) doesn't immediately commit my conversation and send it to the server. Instead, it should wait until I …

ChrisFeldmeier updated 1 month ago

OFA-Sys/AIR-Bench #6

An error about the audio file

I noticed that the file “SNG1312_85.3_109.69.wav” in your dataset on Hugging Face appears to be empty. The file path is “Chat/speech_dialogue_QA_spokenwoz/SNG1312_85.3_109.69.wav.” Could you please ch…

wangwen-banban updated 1 week ago

virtual-labs-archive/speech-signal-processing-iiith-old #83

speech-signal-processing_speech-production-mechanism

**Description:** At given address, hyperlinks for home, all labs, contact, partner, logo and computer science & engineering are not working. **Steps to reproduce the issue:** 1)Open vlabs website th…

Rajadattu updated 5 years ago

tensorflow/datasets #5377

Error when processing speech_commands dataset

/!\ PLEASE INCLUDE THE FULL STACKTRACE AND CODE SNIPPET **Short description** An error occurs when processing the speech_commands dataset. **Environment information** * Operating System: mac0S…

guillaumelorre28 updated 4 months ago

TeamAudio/reaspeech #113

[bug]: Low FPS with large transcript tables

### Operating System Windows ### Other Operating System _No response_ ### Architecture amd64 ### ReaSpeech Image reaspeech (CPU) ### What were you trying to do? Processing large audio files w…

ramen updated 4 days ago

1000+ results for speech-processing

1000+ results
for speech-processing