Whether the longer music sample is the repetition of a shorted sample?

Each music sample corresponds to an image, so the length of the sample can be thought of as the x-resolution. This is limited by the GPU memory you have available. As it stands it is limite to short samples of a few seconds, but there are ways to stitch these samples together, which are explored in the sample notebooks. The idea of this repo was to show what could be done with a single commercial grade GPU: hopefully someone with access to much more compute power can do something more impressive.

teticio / audio-diffusion

Whether the longer music sample is the repetition of a shorted sample? #28