Open gangagyatso4364 opened 2 weeks ago
piper-phonemize==1.1.0, which is not available for macOS, leading to the error message stating that no matching distribution was found.
Dependency Issue: The piper-tts library requires piper-phonemize==1.1.0, but this version does not have a compatible wheel for macOS. Version Availability: The latest version of piper-phonemize available on PyPI is indeed 1.1.0, but it lacks macOS-compatible distributions, which is causing the installation failure
To go ahead with piper-phonemize-cross which has wheel for macos
talk with @10zinten about the data.distribution for the TTS that he used when trained TTS model.
Link to the dataset - https://huggingface.co/datasets/openpecha/tts-sherab
Dataset Format
The pre-processing script expects data to be a directory with:
metadata.csv
- CSV file with text, audio filenames, and speaker nameswav/
- directory with audio filesThe
metadata.csv
file uses|
as a delimiter, and has 2 or 3 columns depending on if the dataset has a single or multiple speakers. There is no header row.For single speaker datasets:
where
id
is the name of the WAV file in thewav
directory. For example, anid
of1234
means thatwav/1234.wav
should exist.For multi-speaker datasets:
where
speaker
is the name of the utterance's speaker. Speaker ids will automatically be assigned based on the number of utterances per speaker (speaker id 0 has the most utterances).Link to the language which is by default espeak-ng - https://github.com/espeak-ng/espeak-ng/blob/master/docs/languages.md
Sub Tasks: