-
`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…
-
### **_Bugs to be fixed_**
- [x] _While scrolling every time comments iteration begins from the first comment, there's some delay in processing new comments._
### **_Enhancements_**
- [ ] _try out wo…
-
### Is your feature request related to a problem? Please describe.
- Speech commands for synthesizers and speech commands for internal speech processing are not well separated.
- The `SpeechManage…
-
FlutterVoiceFriend is an open-source Flutter application designed to help developers build interactive, voice-driven chatbot experiences using a combination of speech-to-text (STT) and text-to-speech …
-
Here are 10 approaches to implement adaptive noise reduction, ordered by complexity/effectiveness:
### 1. Enhanced Spectral Subtraction
- Track noise floor during silence periods
- Use overlappin…
-
I've implemented a button in the client that is supposed to ensure VAD (Voice Activity Detection) doesn't immediately commit my conversation and send it to the server. Instead, it should wait until I …
-
I noticed that the file “SNG1312_85.3_109.69.wav” in your dataset on Hugging Face appears to be empty. The file path is “Chat/speech_dialogue_QA_spokenwoz/SNG1312_85.3_109.69.wav.” Could you please ch…
-
**Description:** At given address, hyperlinks for home, all labs, contact, partner, logo and computer science & engineering are not working.
**Steps to reproduce the issue:**
1)Open vlabs website th…
-
/!\ PLEASE INCLUDE THE FULL STACKTRACE AND CODE SNIPPET
**Short description**
An error occurs when processing the speech_commands dataset.
**Environment information**
* Operating System: mac0S…
-
### Operating System
Windows
### Other Operating System
_No response_
### Architecture
amd64
### ReaSpeech Image
reaspeech (CPU)
### What were you trying to do?
Processing large audio files w…