-
Hi moshi team,
Thank you for the great work on moshi! Would it be possible to open-source the Helium base model? As a single-language large model, it has significant potential for fine-tuning on sp…
-
**Describe the bug**
When I load an advanced model in the Java SDK the Keyword Recogntion does not work at all, the keyword is **never** recognized It will however work perfectly fine with a basic mo…
-
**Project description**
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition…
-
### Description
The goal is to develop a Tibetan text-to-speech (TTS) model that can convert Tibetan text into Tibetan speech. This project involves training a TTS model using filtered good audio qual…
-
I am following the steps in the readme.md document to install the environment on my Windows computer. When I execute
```
python -m omni_speech.serve.model_worker --host 0.0.0.0 --controller http://…
-
When I start the gradio UI the top dropdown is empty and when I click it I get the error
```
2024-09-19 18:09:56 | INFO | gradio_web_server | Models: []
2024-09-19 18:09:56 | ERROR | stderr | D:\…
-
4080显卡,速度可能不到原来的1%,堪比用CPU跑。但看显卡占用又跑满了,找不到原因。是否没有正确调用到打包里的PyTorch和TensorFlow所致?
fasterwhispergui.log如下:
None of PyTorch, TensorFlow >= 2.0, or Flax have been found. Models won't be available and …
-
use openlrc version: 1.5.2
When try to transcribe a video that have no human voice, will get exception `RuntimeError: stack expects a non-empty TensorList`.
I found the following text in log:
``…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
Is there a way that I can use this package for realtime translation with live subtitle w…
-
### **Bug Description**
This issue occurs when using the text_stream_sample with the zh-CN-YunxiaNeural voice model, resulting in unintended pauses between words, which disrupts the natural flow of…