Open williamFalcon opened 6 years ago
C) yes, it is equal to that. E) they're probably trying to generate the entire song so they use a dummy input for the first time step in all rnn tiers. This also answers H. Every part of the actual song is in the target sequence. The very first frame for every tier is generated from dummy inputs. G) Because the last sample is generated from the sample level mlp
Again, great project guys, good implementation!
I had some inline questions from the dataloader file. Not understanding why it's doing the segmentation it's doing.
Thanks! @koz4k