Closed ramiKammoun closed 1 month ago
DATASET is helpful if you have prepared a manifest for multiple datasets in the same root directory; you can remove it or set it accordingly.
train,valid corresponds to your split file names e.g train.txt,valid.txt
88066 is the audio duration * 16000, you can obtain this with soundfile.read(audio_name)[0].shape[0]
label_dir is path to folder that has the .txt files
One last question, does the sample rate have to be 16000 when finetuning? Or could it be for 22050?
must be 16000
I wanted to inquire on the changes to make when trying to fine-tune ArTST for the task of TTS.
From what I understood, i need to change these in the finetune.sh file:
where should I be situated when changing the
/name/of/dataset
?For the
train
andvalid
, to what should they be changed?And finally, for the data_ROOT folder, the files existing in it, in the
test.tsv
file, this is an example of a line used in it:The number
88066
corresponds to what exactly?For the
LABEL_DIR
, what shall we put?