mel-gan Search Results - Githubissues

350 results
for mel-gan

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

henryliangt/tf-emotion #2

api changes

track change of tf

henryliangt updated 3 years ago
1
kaiidams/soundstream-pytorch #1

How to train a new set of data?

Thanks for your code, but I want to learn how to use your modle to train a new set of data, so can you provide a train.py file?

a897456 updated 6 months ago
79
tuanh123789/Train_Hifigan_XTTS #11

Custom dataset resulting in mel.shape[-1] * self.hop_len == …

Hello when I try the code on LJSpeech dataset everything works fine and I am able to train the model, but when I try my own dataset in language different than English, I am able to generate latents wi…

C00reNUT updated 1 month ago
1
k2kobayashi/crank #42

parameters in the configuration files/improving model

are there any detailed informations to all the parameters in the config files and how they affect the audio? ``` conf/mlfb_vqvae.yml cobf/mflb_vqvae.yml ``` I left it all on default and trained 2…

talka1 updated 2 years ago
4
Hangz-nju-cuhk/Talking-Face_PC-AVS #52

stack expects a non-empty TensorList

我在运行你的测试代码的时候，出错了： Network [ModulateGenerator] was created. Total number of parameters: 89.6 million. To see the architecture, do print(network). Embedding size is 512, encoder SAP. Network [ResSES…

TomatoBoy90 updated 1 year ago
2
jaywalnut310/vits #2

is that able to train on Chinese dataset?

is that able to train on Chinese dataset?

lucasjinreal updated 1 year ago
50
jishengpeng/WavTokenizer #19

encounter shape inconsistent in training 16kHz

Thanks for your great work! I want to train wavtokenizer with my own datasets in 16kHZ, but encounter tensor shape incosistent in the following code ``` periodicity_loss, pitch_loss, f1_score = ca…

dyyoungg updated 2 months ago
3
Rudrabha/Wav2Lip #490

/bin/sh: ffmpeg: command not found

Model is loaded successfully but getting error /bin/sh: ffmpeg: command not found please see below, help me. Using cuda for inference. Reading video frames... Number of frames available for i…

Sumit5194 updated 1 year ago
2
babysor/MockingBird #437

【长期】训练克隆特定人声音&finetune

[AyahaShirane](https://github.com/AyahaShirane) 专项训练参照这个视频MockingBird数据集制作教程-手把手教你克隆海子姐的声线_哔哩哔哩_bilibili 实测在已有模型基础上训练20K左右就能改变成想要的语音语调了。你如果是想要泛用型台湾口音的话，就尽可能收集更多人的数据集，否则会偏向特定某一个人的口音，而且断句和停顿似乎也会受到新数据集…

babysor updated 1 year ago
18
huggingface/diffusers #3891

Add Tortoise TTS as a pipeline

### Model/Pipeline/Scheduler description TorToise is a multi-voice text-to-speech system, which describes a way to apply recent advances in the image generative domain to speech synthesis. It would…

susnato updated 1 year ago
16

上一页 1...6 7 8 9 10 11 12...35 下一页

350 results for mel-gan

350 results
for mel-gan