danieldjohnson / biaxial-rnn-music-composition

A recurrent neural network designed to generate classical music.
BSD 2-Clause "Simplified" License
1.91k stars 380 forks source link

Output tends to converge to being identical to one particular piece #26

Closed streamliner18 closed 7 years ago

streamliner18 commented 7 years ago

In training small batches of music (~30), I realized that the output of the network especially further in the training process, tend to directly plagiarize one of the input pieces. This issue is particularly apparent when training with small EDM segments, which are essentially repeating chords. Is there a way to modify the network such that this problem is avoided?

Just for reference, I upsized the network by 2x the original sizes in blind attempt to produce a better result. Not sure if that is a root cause to this issue.

danieldjohnson commented 7 years ago

That sounds like an overfitting problem: you likely don't have enough data for it to learn to generalize well and make novel output. There are a couple of things you can try:

Unfortunately there isn't any "magic bullet" that makes this problem go away for all datasets, so it might be good to just try a few of those and see if anything improves the results.