-
The title of the paper https://arxiv.org/pdf/2410.15608 is
> Moonshine: Speech Recognition for Live Transcription and Voice Commands
However, the model is a non-streaming model, could you describe…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Feature Description
To integrate a Voice-to-Text feature in the Chat Bot using the Web Speech API to improve u…
-
# Repetitive Transcription During Silence or White Noise Periods
## Description
Running Version 0.5.6 (20241017.030846)
Apple M3 Max MacBook Pro
14-inch, Nov 2023
Memory: 128 GB
macOS Sequ…
-
### The Feature
[Chirp](https://console.cloud.google.com/vertex-ai/publishers/google/model-garden/chirp-rnnt1) is a speech to text model, similar to `whisper`
Ideally it could be supported via the…
-
> the turbo model is an optimized version of large-v3 that offers faster transcription speed with a minimal degradation in accuracy.
[openai/whisper: Robust Speech Recognition via Large-Scale Weak …
-
how we record & transcribe now:
1. record chunk of audio of 30s on each device
2. use local voice activity detection model to extract speech frames, if not enough, skip transcription
3. transcrib…
-
-
Add audio transcriptions to all episodes
* Ask HZ for written out texts --> not availble
* Generate texts via [Whisper AI](https://openai.com/index/whisper/) or similar speech-to-text software
-
Hi there, thanks for porting this to Cog/Replicate.
Would you be willing to add auto-transcription via Whisper? See this demo on huggingface where it has already been implemented:
https://huggi…
-
I have tried all the possible settings for Models, sample rate, and channels, I am not able to get recognized speech from VOSK, just the empty strings, I have tried the same sample on free speech reco…