-
## 論文タイトル(原文まま)
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
## 一言でいうと
単一段階のテキスト音声変換モデルであるVITS2は、対向学習とアーキテクチャ設計を用いて、音声の…
-
**例行检查**
[//]: # (方框内删除已有的空格,填 x 号)
+ [ ] 我已确认目前没有类似 issue
+ [ ] 我已确认我已升级到最新版本
+ [ ] 我已完整查看过项目 README,已确定现有版本无法满足需求
+ [ ] 我理解并愿意跟进此 issue,协助测试和提供反馈
+ [ ] 我理解并认可上述内容,并理解项目维护者精力有限,**不遵循规则的 issue…
-
https://github.com/litagin02/Style-Bert-VITS2/blob/bbd89794c68e2ffb351d2ba98c33ca5342aef852/docs/TERMS_OF_USE.md?plain=1#L9-L14
It seems that the above items violate the following section of AGPL v…
-
Bạn có thể hướng đẫn mình cách train chi tiết được không ạ?
Nếu được thì bạn cho mình xin thử pretrained model với!
-
Related: https://github.com/litagin02/Style-Bert-VITS2/issues/99
```
Epoch 35(37%)/1000: 3%|█▊ | 5371/156008 [06:36
-
调用合成函数,字数不多的情况下,延时都能达到3s。Bert-VITS2 v2.3也是差不多。而同样setup的情况下,Bert-VITS2 v2.2延时基本在300ms以内。请问这是因为fish-speech还有Bert-VITS2 v2.3多加了模型的原因吗?
另外有大佬测过fish-speech的streaming模式延时吗?
我是在V100上测的延时。
-
Hi there, I have this error when I want to convert model weight to onnx model.
How can I solve the problem?
Using mel posterior encoder for VITS2
Multi-band iSTFT VITS2
INFO:root:dec.stft.forwar…
-
```
C:\WorkSpace\Style-Bert-VITS2\sbv2\Style-Bert-VITS2\venv\lib\site-packages\pyannote\audio\core\io.py:43: UserWarning: torchaudio._backend.set_audio_backend has been deprecated. With dispatcher …
-
I have created the following PR to add support for Thai language.
I am in the process of creating a dataset to train the model but would love a PR review of the code first to make sure I am on the ri…
-
Running the collab block renders the following error
```
---------------------------------------------------------------------------
TypeError Traceback (most rece…