-
使用Instruct进行推理,希望固定音色输出音频,但现状会有所偏移,有概率出现男女混合的音频。
1. 使用Instruct模型推理,没有embedding,如何指定音色呢?
2. 若通过prompt_text描述音色,应该如何描述自定义的音色?
-
hi i got this error when i run your tutorial code
```
Traceback (most recent call last):
File "D:\workspace\AIs\MyTTS.py", line 34, in
tts.tts_to_file(".زندگی فقط یک بار است؛ از آن به خوب…
-
**Can anyone help me please**
(venv) C:\Users\Dragn\Documents\WeeaBlind-master\WeeaBlind-master\venv>python weeablind.py
C:\Users\Dragn\Documents\WeeaBlind-master\WeeaBlind-master\venv\output\sa…
-
Hi,
I've tried creating an agent using an openAI Assistant as the LLM. It joins the room and works as expected until after the it's first utterance. After speaking the string I pass into the agent.…
-
### Describe the bug
I managed to fine-tuning vctk-vits using language indonesia.
I wanna convert best_model.pth using vits.export_onnx
this is my [config.json](https://gist.github.com/sofianhw…
-
### Describe the bug
Trying to experiment a little bit with running multiple TTS instances at the same time using the docker image, I created 5 different containers, and trying to execute a TTS comma…
-
I have ollama and miniconda in i7 7gen, 1070gtx, 16gb ram.
I change configs yaml like medium.en for medium (because i wan speak in spanish), and change sites "en" for "es". But for in local can hav…
-
![image](https://github.com/user-attachments/assets/8cbd2865-c98b-442c-9f52-a35c7805dcb8)
I have installed all requirements
-
## 在昇腾910b上,使用paddleocr读取表格,耗时达到6s左右, n卡只需0.8s
### 物理环境: cann80RC1-ubuntu20-paddleocr2.7.3-paddlepaddle(3.0.0.dev20240527)
#### 环境变量:
`ENV FLAGS_npu_jit_compile=False
ENV FLAGS_npu_scale_aclnn=T…
-
### Describe the bug
I was testing the Bengali Voice model and it missed the Bengali number pronunciation. Bengali numbers
০ ১ ২ ৩ ৪ ৫ ৬ ৭ ৮ ৯
0 1 2 3 4 5 6 7 8 9.
১৯৫৪ সাল। কালো রাত। Here is su…