Closed deccolquitt closed 2 years ago
@deccolquitt there has been obviously something wrong with your model : where did you get it?
@domkirke I trained it myself from scratch on paperspace using an a100
i have been able to use generation.py with the exported rave .ts file and it produced audio (although I could not with the exported prior .ts file), same applies to using reconstruct.py. The audio wasn't great and had that fuzzy ringing throughout it (which I associated with the early training stages, it was my understanding that things cleared up in the second (prior) stage of training)
Is the sampling rate of 1267073031
intentional ? Same question for the latent space 32682
?
@caillonantoine nope I didn't specify either of those as hyperparameters
@deccolquitt there has been a problem with your model. Did you follow the instruments of the RAVE/README.md correcty? Your sampling rate should match your audio files (44100 / 48000 for audio usually), and the number of dimensions should be around 128. I advise you to train the model again using the cli_helper.py
command line helper.
As per previous comments on solved issue #3, here is my error log when using a custom model in the standalone app