-
Hi all, it appears that [Open Relay](https://www.metered.ca/tools/openrelay/), which Chitchatter uses to establish peer connections when P2P access is unavailable, is changing such that Chitchatter ma…
-
#### Is this a bug, enhancement, or feature request?
Bug
#### Describe your proposal.
UI elements type for all the UIs should be communicated to the user and all info must be conveyed only on…
-
http://www.speech.cs.cmu.edu/haitian/speech
haitian creole from CMU
from linkedin "In 2010, we released both the text data & speech models. Everyone was more focused on the text data to create M…
-
### Project Name
📄 Query PDF (Enhancing Accesibility For All Users)
### Description
# 📄 Query PDF (Enhancing Accesibility For All Users)
## Solution Overview:
**Query PDF** is a voice-powered…
-
We currently have about 7500 hours of oral argument audio without transcriptions. We need to go through these audio files and run a speech to text tool on them. This would have massive benefits:
- Ale…
-
This needs some research/discussion:
The primary issue with finding a suitable speech to text api is that the majority require that the user interact with them via a web browser that uses the [Web …
-
Hello, I'm trying to use the speech-to-transcript alignment on large audio files (6+ hours duration).
I'm getting the error below. I tried with two different files (one almost 6h, the other 9.1h).
…
-
### 🚀 The feature, motivation and pitch
As we all know, GPT-4o is an end2end multi-modal models, which support Speech to Text/Speech. I have some ideas about it:
1. Speech to Text: Can we have a t…
-
Hi,
My name is Hoang Nguyen, I've been using this plugin to add the speech-to-text feature to my application for a while and it's still working great for demos. However, now my app will be a commer…
-
I'd like to raise a concern about how quantization is currently handled in SpeechBrain. While training my own k-means quantizer on the last layer of an ASR model, I noticed that the interface was not …