Hello!
I want to use MeloTTS to output in a voice from my customized dataset.
I'm fairly new to this, and I'm just mega confused about how I could use the checkpoints or pretrained models and finetune it on top of my own data.
Right now I just have this script which I just run with model.py.
import torch
from melo.api import TTS
# Speed is adjustable
speed = 1.0
device = 'cpu' # or cuda:0
text = "안녕하세요! 오늘은 날씨가 정말 좋네요."
checkpoint_path = '/home/tts/MeloTTS/melo/configs/checkpoint.pth'
config_path = '/home/tts/MeloTTS/melo/data/kor_config.json'
model = TTS(language='KR', device=device, config_path=config_path, ckpt_path=checkpoint_path)
speaker_ids = model.hps.data.spk2id
For the kor_config, I am using the korean config file, and in there I have set my code to
Hello! I want to use MeloTTS to output in a voice from my customized dataset. I'm fairly new to this, and I'm just mega confused about how I could use the checkpoints or pretrained models and finetune it on top of my own data.
Right now I just have this script which I just run with
model.py
.For the kor_config, I am using the korean config file, and in there I have set my code to
The checkpoint is the korean checkpoint to the korean checkpoint in
download_utils.py
.How can I now fine-tune the model on my own data?