-
Automatic Speech Recognition or ASR, as it's known in short, is the technology that allows human beings to use their voices to speak with a computer interface in a way that, in its most sophisticated …
-
# Automatic Speech Recognition
ASR is a text generation task that convert spoken language into its written form.
## Task Objective
Automatic Speech Recognition (ASR) is a classic task in the…
-
Is there any other work around to improving default automatic speech recognition?.
Default speech recognition not working properly for the local english slang.
-
### Environment
🪟 Windows
### System
w11
### Version
latest
### Desktop Information
_No response_
### Describe the problem
when i select whisper Local on the web ui speech r…
-
### System Info
- `transformers` version: 4.42.2
- Platform: Windows-10-10.0.22621-SP0
- Python version: 3.10.14
- Huggingface_hub version: 0.23.4
- Safetensors version: 0.4.2
- Accelerate ver…
-
I am using Swift pod 'MicrosoftCognitiveServicesSpeech-iOS', '~> 1.25' for continuous speech recognition. I want to implement a feature where the recognition automatically stops if the user doesn't sp…
-
I would love it if the script could automatically scroll based on audio input that is parsed via speech recognition software. I found a list here: https://fosspost.org/lists/open-source-speech-recogni…
-
Hello(s) Dear, @f4str , @GiulioZizzo , @beat-buesser ! is it possible to dynamically parameterize the face of the classifier *HuggingFaceClassifierPyTorch* otherwise, it doesn't seem as dynamic as …
-
```
python run_gpu.py "openai/whisper-medium" "whisper-medium-onnx-int4-inc" "ukrai
nian_speech.wav"
You are using a model of type whisper to instantiate a model of type . This is not supported for…
-
### System Info
xenova/transformers.js#v3
### Environment/Platform
- [ ] Website/web-app
- [ ] Browser extension
- [ ] Server-side (e.g., Node.js, Deno, Bun)
- [ ] Desktop app (e.g., Elect…