yl4579 StyleTTS issues - Githubissues

yl4579 / StyleTTS

Official Implementation of StyleTTS

MIT License

385 stars 62 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Is there a first-stage pretrained model?

#77 cnint0627 opened 2 days ago
0
can't import mask_from_lens from monotonic_align

#76 cnint0627 opened 3 days ago
0
amazing work,can it support that generate the phoneme delayed time sequence?

#75 CasonTsai opened 1 month ago
0
MelSpectrogram() and unspecified sampling rate

#74 dsplog opened 4 months ago
0
Voice Quality issue using Librispeech

#73 Anshu-Kumar-1 opened 6 months ago
0
Marathi Support ?

#72 raushanagrawal opened 7 months ago
0
Is the uv detector trained in the pretrained pitch detector?

#71 auspicious3000 opened 7 months ago
0
Has anyone had this problem when converting to onnx?

#70 bobo-paopao opened 7 months ago
0
S2S

#69 pawanhv closed 8 months ago
1
StyleTTS 2

#68 fakerybakery closed 8 months ago
1
continue correct number of epoch from pretrained

#67 magicse opened 9 months ago
0
Update train_second.py

#66 magicse closed 9 months ago
0
question

#65 magicse closed 9 months ago
2
Pre-training model sound quality issues

#64 gachaun closed 9 months ago
5
train on ESD

#63 bobo-paopao closed 9 months ago
1
Inference exact time for each word

#62 enla51 closed 10 months ago
5
Question: Fine tuning LibriTTS with StyleTTS

#61 Yahya-khodr closed 10 months ago
5
The pronunciations of single words or short words is poor?

#60 GuangChen2016 closed 10 months ago
1
Training Model with new Dataset

#59 Yahya-khodr closed 11 months ago
0
Training the model

#58 Yahya-khodr closed 11 months ago
1
what is the mean=-4 and std=4 meannig?

#57 skysbird closed 11 months ago
1
First stage alignment training failed when TMA_CEloss=True

#56 auspicious3000 closed 11 months ago
2
style encoder inconsistency

#55 auspicious3000 closed 11 months ago
1
batch size and number of epochs for large dataset

#54 auspicious3000 closed 11 months ago
1
多卡训练的问题

#53 bobo-paopao closed 11 months ago
5
About train on Vietnamese Dataset

#52 christopherohit closed 11 months ago
1
hi，如果使用v4000 单卡训练LibriTTS数据集，显卡内存只有16G， batch_size只能设置为8，训练时常大概为多长呢？谢谢

#51 bobo-paopao closed 1 year ago
1
Need help for training

#50 nhanhttrong closed 12 months ago
5
s2s loss 和mono loss在第一阶段的训练中一直为0

#49 bobo-paopao closed 1 year ago
1
how to Inference

#48 luwentao1989 closed 1 year ago
1
Source of Hifigan checkpoints?

#47 kmn1024 closed 1 year ago
1
pretrained model of stage 1

#46 nhanhttrong closed 1 year ago
0
Can you offer the loss log of style tts

#45 hdmjdp closed 1 year ago
1
asr phone dict different from this

#44 hdmjdp closed 1 year ago
1
about ASR model

#43 hdmjdp closed 1 year ago
1
Would you recommend changing the code in the inference notebook when running a PL-BERT finetuned model?

#42 ghost closed 1 year ago
1
error while running train_second.py (caused by size mismatch)

#41 ghost closed 1 year ago
2
Code for Emotional Speech Synthesis

#40 satani99 closed 1 year ago
1
Questions about the Evaluations

#39 Zhongxu-Wang closed 1 year ago
1
turns out your code doesn't join the read wav paths from train_list.txt file with the dataset path (the location of train_list.txt)

#38 ghost closed 1 year ago
0
running train_first.py raises error

#37 ghost closed 1 year ago
3
Any-to-any and emotion examples

#36 sleimanitani closed 1 year ago
6
Phoneme sequence padding

#35 WorkingJack closed 1 year ago
3
Fixing librosa compatibility issue and UTF-8 issue

#34 Artyom17 closed 1 year ago
0
What's r1_reg loss?

#33 splinter21 closed 1 year ago
1
Probleam about data processing

#32 Zhongxu-Wang closed 1 year ago
1
crashes during training

#31 ppisljar closed 1 year ago
4
Why don't use "attention_weight" in train_first.py ?

#30 dy2009 closed 1 year ago
1
Why I can't use your mel to train HiFi-Gan Vocoder ?

#29 dy2009 closed 1 year ago
1
f0_extractor ?

#28 dy2009 closed 1 year ago
0