Closed xavriley closed 9 months ago
I've solved this after a good night's sleep 😅
In my case this was a data issue. I was using a chunk size of 10 seconds and in most of my training data there were long notes being held toward the end of the piece. The notesStrictlyContained
setting meant that, in some cases, the note was removed leaving lots of frame activity with no note associated which caused the gradients to blow up.
The fix in my case was to take 15 seconds off the duration
value when building the dataset which avoids these edge cases in my data. Leaving this here in case it helps others.
Hi,
Thank you very much for this repo - I'm trying to train this model from scratch on some Saxophone recordings.
Firstly, I was getting weird errors for
It might be worth mentioning these in the README for people who want to train on something other than Maestro.
The error I'm now encountering is during the first epoch
It looks like many of the parameters don't have their gradients initialised. This is strange because at this point in the run it has completed a backward pass so I thought all the gradients should have been set. I'm using the following settings to train:
Can you give me any tips on what to try next?