Open platform-kit opened 10 months ago
Yeah I could add this, it was already in the web-ui repo. It was using fast_whisper to do it, I could add it.
That would be amazing!
Hey @Haurrus any progress on this?
I added this xtts_generate_dataset.py to create dataset, you can look into the readme :
Audio Dataset Preprocessing xtts_generate_dataset.py
Hi, this script looks very useful.
Does it automate dataset creation too? If not, I think that would be a great feature to add, so that one can simply point it at a directory of audio of various formats (mp3 / wav at least) and then chop them into clips, and generate the .CSV.