Closed egorsmkv closed 2 years ago
Processor is highly related to speech corpus. So, in order to preserve generality, the project would implement the processors for those common datasets. A simple way is to make your dataset compatible with those common datasets, and you can directly make use of the existing processors.
I have seen code for processors and in my opinion it's easy to write own one.
Your advice about common format is right, fully support it.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs.
The dataset is here - https://github.com/egorsmkv/ukrainian-tts-datasets/tree/main/lada
Hello.
If I will publish a high-quality dataset (around 8 hours of speech) for Ukrainian, then you can make a processor for it?