Open dbarroso1 opened 6 years ago
Hi @dbarroso1 , Did you eventually find how to format your data? I’m at the same stage. I couldn’t figure how to properly do it but duplicating the transcript.csv file from the LJ dataset and carefully pasting in my own dataset, sentence by sentence, did the trick. Not a particularly sustainable or elegant solution…
I am also facing this bucket error I checked the maxlen(151) and minlength (149) that's why in for loop there is no iteration , so there is no value in bucket . If anyone solved this problem kindly help me in solving this issue
Hello, ive been trying to make my own Training data, but there doesnt seem to be a ton of resources on how the data should be formatted. Ive compared the LJ001 Data and tried to imitate it, including splitting wavs, and the transcript.csv.
I have tested train.py with the LJ001 Data and the trainer works, but when i try with my Data it fails, giving me this error:
Here is an example of the CSV File, i tried matching the ID, TEXT, LENGTH Format.
So tldr two questions:
bucket_boundaries must not be empty
Error when python finds the CSV and can read it.