-
# Speech Summarization
Speech summarization refers to the process of condensing spoken language into a shorter version while retaining its essential meaning and key points. Speech summarization aim…
-
# Task Name
African American Vernacular English (AAVE) Speech Recognition
## Task Objective
Mainstream speech recognition systems often perform poorly on non-standard dialects and sociolects,…
-
Although the title implies code modifications, I didn't make significant changes compared to the previous version since it is already functional. I do have some additions in mind, but they'll only be …
-
### Ticket Contents
## Description
Bhashini provides APIs for products to perform Automated speech recognition (ASR). These APIs use models hosted by Bhashini in the cloud and can be simply integr…
-
Hi! !مرحبا! السلام عليكم
Let's bring the documentation to all the Arabic-speaking community 🌏 (currently 0 out of 267 complete)
Would you want to translate? Please follow the 🤗 [TRANSLATING guid…
-
Hello,
we are a bit confused about the function of the `--language` flag. Does it
- restrict the transcription to the specified language
or
- translate whatever language it recognizes to the sp…
-
Looks like google is making an update to the speech to text api.
---
Dear Speech-to-Text user,
We’re writing to let you know about the changes coming to Google Cloud Speech-to-Text API. We’ll m…
-
#### Describe the bug
Currently, offline Speech Recognition only recognizes US English and more languages need to be supported.
#### Expected behavior
Configure more models from here https://so…
-
## ❓
NEED HELP/FIX ASAP
already logged issues.
#3683 : https://github.com/facebookresearch/fairseq/issues/3683
AssertionError: Could not infer task type from {'_name': 'temp_sampled_audio_pret…
-
Type: Bug
Once Claude Sonnet is selected in the Pick Model option within Chat, it goes back to gpt4o after I restart my computer or reload my session.
VS Code version: Code - Insiders 1.96.0-insid…