-
track change of tf
-
Thanks for your code, but I want to learn how to use your modle to train a new set of data, so can you provide a train.py file?
-
Hello when I try the code on LJSpeech dataset everything works fine and I am able to train the model, but when I try my own dataset in language different than English, I am able to generate latents wi…
-
are there any detailed informations to all the parameters in the config files and how they affect the audio?
```
conf/mlfb_vqvae.yml
cobf/mflb_vqvae.yml
```
I left it all on default and trained 2…
-
我在运行你的测试代码的时候,出错了:
Network [ModulateGenerator] was created. Total number of parameters: 89.6 million. To see the architecture, do print(network).
Embedding size is 512, encoder SAP.
Network [ResSES…
-
is that able to train on Chinese dataset?
-
Thanks for your great work!
I want to train wavtokenizer with my own datasets in 16kHZ, but encounter tensor shape incosistent in the following code
```
periodicity_loss, pitch_loss, f1_score = ca…
-
Model is loaded successfully but getting error /bin/sh: ffmpeg: command not found please see below, help me.
Using cuda for inference.
Reading video frames...
Number of frames available for i…
-
[AyahaShirane](https://github.com/AyahaShirane)
专项训练参照这个视频MockingBird数据集制作教程-手把手教你克隆海子姐的声线_哔哩哔哩_bilibili
实测在已有模型基础上训练20K左右就能改变成想要的语音语调了。你如果是想要泛用型台湾口音的话,就尽可能收集更多人的数据集,否则会偏向特定某一个人的口音,而且断句和停顿似乎也会受到新数据集…
-
### Model/Pipeline/Scheduler description
TorToise is a multi-voice text-to-speech system, which describes a way to apply recent advances in the image generative domain to speech synthesis. It would…