jaywalnut310 vits issues

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

https://jaywalnut310.github.io/vits-demo/index.html

MIT License

6.91k stars 1.27k forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Is it normal to wait a long time between each epoch？

#122 HuaHuaOfficial opened 1 year ago
4
Inference result is not as good as the demo

#121 ali-elkahky opened 1 year ago
5
FP32 training and alternative models

#120 NikitaKononov opened 1 year ago
0
Use config boundaries instead of hardcoded

#119 nikich340 opened 1 year ago
0
Reduce RAM usage: remove excess workers

#118 nikich340 opened 1 year ago
3
Not all .wav files generate corresponding spec.pt

#117 anitman opened 1 year ago
2
[Question] How many iterations for the available pretrained model?

#116 shivammehta25 opened 1 year ago
3
colab进行Multi Speaker测试时，出现错误

#115 sun-rabbit opened 1 year ago
1
colab上无法运行测试

#114 sun-rabbit opened 1 year ago
0
Sorry I have a Question about RuntimeError

#113 mao-mao-yu closed 1 year ago
2
Question about VITS KL Loss Formula

#112 MMingabc opened 1 year ago
1
How can we keep the output the same when the input is the same？

#111 Leo4zhou closed 1 year ago
1
Ea 45 setup cpu inference

#110 arnasRad closed 1 year ago
0
VCTK Multi Speaker Models Training Results (How many steps required for results like Pretrained)?

#109 athenasaurav closed 1 year ago
5
Whether kl loss is negative will affect the model convergence?

#108 980202006 opened 1 year ago
0
When I train multispeaker get RuntimeError: cuDNN error: CUDNN_STATUS_INTERNAL_ERROR

#107 cliuxinxin closed 1 year ago
2
Problems about training with multiprocess

#106 BrianWayland opened 1 year ago
1
How to finetune the given pre-trained model?

#105 apzl opened 2 years ago
6
what is difference between val.text and train.txt

#104 queechy opened 2 years ago
0
CUDA ERROR!

#103 NK990 opened 2 years ago
1
Suggestion on Over-fitting?

#102 Ahmer-444 opened 2 years ago
0
How to conitnue trainig?

#101 NK990 closed 2 years ago
1
Fix typo in modules.py

#100 eltociear opened 2 years ago
0
What is the relative position method

#99 TheHonestBob opened 2 years ago
0
vits is awsome! Can vits train with emotional dataset?

#98 akfheaven opened 2 years ago
2
你好，想拿来试试运行在colab，但是不成功

#97 upright2003 closed 1 year ago
2
torch.nn.modules.module.ModuleAttributeError: 'SynthesizerTrn' object has no attribute 'emb_g' 【anybody help me，please！】

#96 lunar333 opened 2 years ago
1
control and change the energy and pitch by manipulating the latent representation

#95 tuannvhust opened 2 years ago
1
没人反馈这个语音API目前失效了吗

#94 xingyunlee12138 opened 2 years ago
0
没人反馈这个语音API目前失效了吗....

#93 xingyunlee12138 closed 2 years ago
1
Questions about the Korean datasets

#92 ORI-Muchim closed 1 year ago
3
Question about strange voice

#91 panxin801 opened 2 years ago
0
Error on tensor size & How to config with 16KHz waves?

#90 voxServalG opened 2 years ago
2
Is there any method which can adjust the speed of synthesized speech?

#89 guofengjpggwk opened 2 years ago
1
size mismatch for enc_p.emb.weight

#88 kdrkdrkdr closed 2 years ago
1
How many steps should we train to get the best results?

#87 futureaiengineeer opened 2 years ago
12
I want to train VITS model for URDU language

#86 Muhammad-Afnan-Akram opened 2 years ago
0
Mispronounce some words and 44,1 Khz audio

#85 tuannvhust opened 2 years ago
2
Can I use it without GPU?

#84 itsdapi opened 2 years ago
3
all loss keeps almost the same during training when using VITS to train multi-lingual datasets

#83 zhufeijuanjuan opened 2 years ago
0
Can anyone explain me what the boundaries are for?

#82 tuannvhust closed 2 years ago
2
Questions about 48k audio file train

#81 H4ppyB1rd opened 2 years ago
6
Dev cython ext

#80 payonear closed 2 years ago
0
有辦法做到單行指令轉換嗎?

#79 upright2003 opened 2 years ago
0
Something wrong hapeened to my interpreter?RuntimeError

#78 redmist328 closed 10 months ago
3
RuntimeError: The expanded size of the tensor (8192) must match the existing size (448) at non-singleton dimension 1. Target sizes: [1, 8192]. Tensor sizes: [448]

#77 zgfjzs opened 2 years ago
0
RuntimeError: view_as_complex is only supported for float and double tensors, but got a tensor of scalar type: Half

#76 shoang22 closed 2 years ago
3
RuntimeError During Training

#75 QuellaMC closed 2 years ago
1
Training error

#74 Eternity231 closed 2 years ago
6
AssertionError: 4D tensors expect 4 values for padding

#73 kagura114 closed 2 years ago
3

Previous Next