ZackHodari / tts_data_tools

Data processing tools for preparing speech and labels for training TTS voices
24 stars 3 forks source link

Could you please offer some examples? #2

Closed Yablon closed 4 years ago

Yablon commented 4 years ago

Hello, could you please offer some exapmles, showing that how I can extract what from what ? Thank you !

ZackHodari commented 4 years ago

If you want to extract Merlin-equivalent labels take a look at the process_dataset.py script. It basically called lab_to_feat.py and world_with_reaper_f0.py, matches the length number of frames in the durations with the acoustic features, and trims silences (if requested).

The project is written with the intention that some people can use the provided scripts out of the box. But if you need more custom feature extraction the idea is that you can write your own script (e.g. extract phone identity).

Label creation scripts are found in lab_gen/ and have the following functionalities:

Waveform feature extraction scripts can be found in wav_gen/, the script world_with_reaper_f0.py does the following:

Take a look at the README for more details.

Yablon commented 4 years ago

@ZackHodari Thanks !