Closed mahimairaja closed 2 months ago
The same error occurs in Chinese, the data preprocessing function doesn't seem to work with CJK characters.
Alright, does anyone already working on this issue?
This website is also owned by Microsoft. You can give it a try
This error message AssertionError: [!] No training samples found in /tmp/xtts_ft/dataset//tmp/xtts_ft/dataset/metadata_train.csv
happens because the dataset processing
didn't generate any dataset on which the fine-tuning process (next tab) relies.
Your dataset directory should have the following structure after the dataset processing
is done.
where wavs
directory contains all dataset divided into clips and metadata_eval.csv
, metadata_train.csv
maps these clips with their corresponding transcription or text see below where Arabic voices were used.
data processing
.ASR
process. Try a larger version of it.This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.
@jianchang512 @rose07 @zaher-m @zaher-m Can you provide code for fine tune XTTSv-2 please
Describe the bug
It seems that there is hidden issue behind the dataset preparation for fine-tuning TTS on Japanese Language
To Reproduce
Add few Japanese Speech Audio samples to the dataset processing and click
Create Dataset
Move to the fine-tuning tab and run the training
And the
error
message pops up:Expected behavior
The fine-tuning process should run, without interpretation.
Logs
Environment
Additional context
No response