Closed AWAS666 closed 10 months ago
You also can provide updated config to run this on actual vctk dataset
You also can provide updated config to run this on actual vctk dataset
oh it's just adding in the new file paths, depending on whet ever these should be the main files or not it can be added, otherwise just rename them to replace the current
Thanks for the updated filelist!
Thanks for the updated filelist!
should we update the original files, since these are currently new ones?
We can keep both versions and add a new config file for vctk_new ? Thoughts?
What if we add a function in dataloader that skips missing files and stores them in a text file under the logs folder of the training for the user to later verify that? If number of files available is less than ~30, we can assert error.
It could be just a warning to say like 'Looks like you are using old version of dataset'
It could be just a warning to say like 'Looks like you are using old version of dataset'
Good practice is to generalize your code for future versions. But for now a simple warning like this could work too.
I mean the current is perfectly fine for anyone using their own dataset, it's just about the vctk versioning. I'd do just the new config for now and maybe add a flag to ignore the error altogether?
Cool. Did anyone train back yet? How are the results?
I've reset after some first trial and error runs to see what dataset works. Currently sitting at 124 epochs and the sample in tensorboard sounds decent.
Though I cant do any inference as I've moved to windows and I got no espeak there :(
I've reset after some first trial and error runs to see what dataset works. Currently sitting at 124 epochs and the sample in tensorboard sounds decent.
Though I cant do any inference as I've moved to windows and I got no espeak there :(
Thanks for your feedback! Good to see something positive about multispeaker training. You can always infer on Google Collab for free, temporarily.
changes I made to the original file list to be compatible with the newest vctk 0.92 version, not sure if those should be kept seperate or merged with the original ones.
tldr: I made a little script that deleted any entries which have no audio files