-
StyleTTS's ref mel requires a single audio as input, which may result in the style vector only being similar to the ref wav, but somewhat different from other waves of the same speaker. May I ask if t…
-
Went through few of your answers in the issues, would like to know:
1) whether your suggested modifications for VITS and FastSeech 2 models apply only for inference or for finetuning too?
2) in the …
-
a) when loading check point for train_second.py, apart from the nets for mpd, msd and wd, saw the need to add the prefix "module." to make the key names compatible. is this expected ?
```
diff --…
-
Hello!
Thank you, you have done an incredible and very useful work for the community
I would like to train PL-BERT for Slavic languages: Polish, Russian, Ukrainian
I don't really understand h…
-
Thank you for your work! Is there any ETA on when the training and inference code will become available?
-
Trying to fine-tune on a custom dataset
Everything starts normal, after some steps the script dies with:
> RuntimeError: Calculated padded input size per channel: (5 x 4). Kernel size: (5 x 5). …
-
Hello. Just discovered this.
Is there a way to set language? Maybe changing speaker?
I'd like to read Spanish epub.
-
Great work, thanks.
-
### News
- Conferences
- CVPR 2022: 6.19 ~ 24 (New Orleans)
- 대기업 중심의 AI 투자 관련 (SK, LG, KT 등등)
- [스캐터랩, 정부와 AI 윤리점검표 개발 추진](https://n.news.naver.com/mnews/article/092/0002259047?sid=105)
### …