-
### **Description:**
Integrate a speech-to-text module to handle audio files, converting speech to text for interaction. This feature will allow users to upload audio files and interact with the tran…
-
### Discussed in https://github.com/langchain-ai/langchain/discussions/27404
Originally posted by **kodychik** October 16, 2024
### Checked
- [X] I searched existing ideas and did not find …
-
Currently batch processing attempts to process all files in the given directory, even hidden ones (macOS's `.DS_store` is currently deny listed). Attempting to process a non-audio file causes the tran…
-
### API Spec link
https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/data-plane/Speech/SpeechToText/preview/2024-05-15-preview/speechtotext.json
### API Spec ver…
-
Is there a way to transcribe multiple audio files asynchronously/parallel with whisperX?
-
### Describe the feature
While using vibe for transcribing many files, there are some US file tunes i would love to have
- Allow selection of nested folders. my files are organized in 3 levels of fo…
-
As of recently, it seems like the whisper large v3 supports batch transcribing (i.e. multiple audio files at once), is this feature available in whisperX with large v3? If so, can someone share a smal…
-
### Describe the bug
Hi, amazing work so far with SpeechBrain!
I am implementing an ASR model focused on the Italian language using SpeechBrain. My first step was to try the pretrained models avai…
-
**Is your feature request related to a problem? Please describe.**
I want to be able to use the whisper-1 model provided by OpenAI to transcribe text from audio. LLMs are also getting more multi-moda…
-
use openlrc version: 1.5.2
When try to transcribe a video that have no human voice, will get exception `RuntimeError: stack expects a non-empty TensorList`.
I found the following text in log:
``…