allenhung1025 / LoopTest

Official repo of ISMIR-21 publication, “A Benchmarking Initiative for Audio-domain Music Generation using the FreeSound Loop Dataset”.
MIT License
77 stars 12 forks source link

training four-bar models #3

Closed naotokui closed 2 years ago

naotokui commented 2 years ago

Thanks for sharing this great repo! One question I have is... how did you train the four-bar model? Every time I tried to train some four-bar models with four-bar-long loops, I always faced mode collapses. I tried smaller learning rates, but it didn't work either. Any tips?

allenhung1025 commented 2 years ago

Hi @naotokui, May I ask how much data you have used in training? From my experience, I have used roughly 6000 four-bar-loop to have a good result.

naotokui commented 2 years ago

Sorry, I missed your comment. It turned out... the issue was the small difference between the shape of preprocessed data (80, 641) and the shape of the generated spectrogram (80, 640) in the case of my two-bar model. Thanks again for your feedback!

littlebeanhp commented 1 year ago

Sorry, I missed your comment. It turned out... the issue was the small difference between the shape of preprocessed data (80, 641) and the shape of the generated spectrogram (80, 640) in the case of my two-bar model. Thanks again for your feedback!

@naotokui Have you successfully trained the 4-bar model, and Is this still the solution? I have trained the 4-bar model and it generates absolute nonsense... The mels are all distorted.