Closed sandorfoldi closed 1 year ago
I tested this on my local machine, voltash and it throws no errors. Tried to submit it as a job too, so far I got no notifications
@panosapos how will this extraction of audio lengths work when we have a mixture of datasets? Have you discussed that?
@panosapos how will this extraction of audio lengths work when we have a mixture of datasets? Have you discussed that?
You think that the solution you suggested yesterday would not work in this case?
@panosapos how will this extraction of audio lengths work when we have a mixture of datasets? Have you discussed that?
You think that the solution you suggested yesterday would not work in this case?
No no, it will. I just want to be reassured that you guys talk to each other :)
During preprocessing, spectrograms are now saved as torch tensors The lengths of audiofiles are also saved to a csv file
The following two collators are implemented and can be selected from cfg:
The audio_lengths csv file is used when collator == DeleteShorts, and this way only the long enough audiofile paths are stored in the dataset => batch size is constant