-
作者您好,我已经按照教程自己跑了一遍,并合成了我想要的语音。我想问一些有关技术细节的问题,希望能得到您的解答!
1、哔站教程中13:27提到的“使用hubert和RVQ将音频转化为包含音色的语意token”这句话中的语义,指的是目标音色说话人所提供的参考语音的语义,还是想要合成的目标语音内容的语义?
![image](https://github.com/RVC-Boss/GPT-SoVITS…
-
Hi @p0p4k, thanks for making this repo!
I am currently trying to train a 44.1kHz English model, but my model is struggling with a rather high duration loss when compared against your TensorBoard lo…
w11wo updated
5 months ago
-
I am trying to convert the model to a torch TensorRT pre-compiled model for inference on an embedded device. I am using this script:
```
import torch_tensorrt
import torch
model = torch.hub.lo…
-
![image](https://github.com/user-attachments/assets/a33a769f-88f9-4d53-bdcc-207eeac70cb9)
配置文件代码:
std::string vits_model = "./vits-melo-tts-zh_en/model.onnx";
std::string vits_lexicon = "./vits-m…
-
Please see https://github.com/k2-fsa/sherpa-onnx/discussions/890
@yaambe and @thewh1teagle have managed to convert MMS models to sherpa-onnx
There are plenty of MMS tts models from https://huggi…
-
![image](https://github.com/user-attachments/assets/9b971c68-96db-4a16-b33d-425bebee18f0)
i don't know why generated image like this, just downloaded repo and vitl parameters. Using example folders…
-
When you were training, to what loss value did it decrease to produce the results you reported in your paper? When I trained on a small dataset of 2 hours of Vietnamese audio for up to 1000 epochs, my…
-
### 🐛 Describe the bug
my code list below:
######################################################################
import torch
import torch.nn as nn
from torch.ao.quantization.observer import d…
-
**Describe the bug**
A clear and concise description of what the bug is.
**Basic environments:**
- OS information: Linux 4.15.0-176-generic #185-Ubuntu SMP Tue Mar 29 17:40:04 UTC 2022 x86_64
…
-
#20 没有生效。看commit记录, 0dfc12a0110d4fe1b935aaafd1c20dcca387bead 中if恒为False,也没有相应的配置选项。