Closed thoraxe closed 1 year ago
Correct, the method uses an internal version that has been preprocessed for unit selection synthesis in the past in our institute.
The path to transcript dicts are the interface between the toolkit and the data, and since everyone likes to store their data in different ways, they are not generally applicable. The idea is, that if you want to train on some data, the path to transcript dict is the one thing that you have to set up yourself. You can use the path to transcript dict of the thorsten dataset as a template, I believe this one is formatted the same way as LJSpeech when it is downloaded and not further changed. Only the delimiter used in the transcription file might be different.
When you download the official LJSpeech dataset from https://keithito.com/LJ-Speech-Dataset/, you do not get any text files.
This dict builder expects a folder structure that does not exist in the original dataset as you can download it today.