-
### The Feature
[Chirp](https://console.cloud.google.com/vertex-ai/publishers/google/model-garden/chirp-rnnt1) is a speech to text model, similar to `whisper`
Ideally it could be supported via the…
-
I'm trying to use the voice 'en-US-AnaNeural' in the [tts-text-stream sample](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/python/tts-text-stream/text_stream_samp…
-
### What happened?
On the API docs, I found https://learn.microsoft.com/en-au/azure/ai-services/speech-service/how-to-lower-speech-synthesis-latency?pivots=programming-language-python#text-streamin…
-
```
Faster-Whisper-XXL.exe --model=large-v2 --language=en --output_dir "C:\MP3\TV & MOVIES & GAMES\TRIBUTES\1997 - TV Terror - Felching A Dead Horse" --output_format srt --vad_filter True --max_l…
-
Hello,
First of all thanks for developing this tool and making it available ! I'm trying to use crisperwhisper to annotate a naturalistic language production experiment in german. The files are 1mn…
-
**Project description**
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition…
-
"The effect of Whisper in Chinese speech recognition is very poor, with almost all recognitions being incorrect. I hope to add support for sherpa-onnx-sense-voice-zh-en-ja-ko-yue-2024-07-17."
This …
-
When I start the gradio UI the top dropdown is empty and when I click it I get the error
```
2024-09-19 18:09:56 | INFO | gradio_web_server | Models: []
2024-09-19 18:09:56 | ERROR | stderr | D:\…
-
### System Info
transformers.js: 3.0.2
chrome: 130
OS: macos
### Environment/Platform
- [X] Website/web-app
- [ ] Browser extension
- [ ] Server-side (e.g., Node.js, Deno, Bun)
- [ ] Des…
-
use openlrc version: 1.5.2
When try to transcribe a video that have no human voice, will get exception `RuntimeError: stack expects a non-empty TensorList`.
I found the following text in log:
``…