jasonppy / VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild
Other
7.53k stars 740 forks source link

Finetune dataset preparation #106

Open rikabi89 opened 5 months ago

rikabi89 commented 5 months ago

So I am looking to finetune probably a single speaker and small dataset and so far I followed your instructions to :

image

Which has downloaded and extracted the relveant files from the dataset.

image

But I am bit confused or lack the knowledge as to how I prepare the dataset to finetune my own dataset. I am only looking to finetune on top of your pretrained model.

rikabi89 commented 5 months ago

@jasonppy Do you not have any plans to do this? Like I streamlined process for finetuning?

I managed to put some scripts together (with AI)to create a dataset but having issues with skipping text emedding. Mainly I am not sure how this is achieved.