-
I was looking ove rthe implementation of you text to speech model. Is it working fine and if it is working then from where can I test It.
Thanks in advance.
-
https://github.com/tuanh123789/AdaSpeech/blob/64f15c4b3fa4590267f12930d7aaf411a1b36d1e/preprocessor/preprocessor.py#L336
-
hi you, I'm trying to train model AdaSpeech from your project. However when I train pretrained model I get nan loss during training.
do you have any way to fix it from your code?
![image](https://us…
-
Hi, I followed your work for several months and really pleasantly surprised at your speed of tracking the new algorithm.
For the Adaspeech, have your verify that the two acoustic encoder really help…
-
hi, I want to know what's the use of "reference_audio" when inference?
-
hello rishikksh20, thanks for your contribution!
I found a problem when training with these code.
in line 415, fastspeech.py
```
if avg_mel is not None:
avg_mel = avg_mel.unsqueeze(0)
# …
-
想请问下,SpeakerAdapter这个模块作用大吗?特别是在fine-tune 的时候
-
I have trained VITS model now and when I apply LORA to attention layer, fine-tuning is not working properly, could you please tell me which layer you applied to fine-tune VITS model with LORA and what…
-
Recently two papers regarding Transformer TTS pops up and I think both are suitable for this repo:
1) [DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021](https://arx…
-
感谢作者推出的针对中文的vits代码,我使用标贝女生数据集加一个目标人数据训练模型,目标人大概30分钟,以32的批次跑了480k步,不论是标贝女生还是自定义的目标人,效果很一般,感觉不太自然,而且有些发音不太准确。 请问作者的训练细节是什么呢,比如使用批次大小多少,在训练到多少步,合成效果比较好呢?