Possibly broken unet2+ - Githubissues

deepglugs commented 2 years ago

It seems that unet1 progresses quickly. On my dataset, within 10-20 epochs I can get pretty good results at 64px:

But when trying unet2, the results are poor even after more than 20-40 epochs:

Here's another example of unet2 of mel-spectograms after many many steps:

imagen_158_84_loss0 02229

Granted, it is possible that unet1 results also look similar, but it's hard to tell with the blur going on. I assume that's the result of the noising function? Is there a way to turn off the noising functionality during sample if stop_at_unet is specified at 1? That said, with the spectogram, unet1 produces very good blacks in the padding area, but unet2 can't get it right.

I know other's have voiced issues with unet2 as well. Here are my unet2 settings for reference:

unet2 = dict(
            dim=128,
            cond_dim=512,
            dim_mults=(1, 2, 3, 4),
            cond_images_channels=cond_images_channels,
            num_resnet_blocks=2,
            layer_attns=(False, False, False, True,
            layer_cross_attns=(True, True, True, True),
            # final_conv_kernel_size=1,
            memory_efficient=True
        )

Note: I've also tried with dim_mults (1, 2, 4, 6) and num_resnet_blocks=(2, 2, 4, 8) with similar results.

integer753 commented 2 years ago

I'm also having problems upscaling. I'm on version 0.25.4, i have a well trained unet1 that's giving good results. Training unet2 for 200k steps (batch size 64) gives me very noisy output: When i trained on version 0.8.8, i had excellent output and it was never noisy and ultimately had great results:

I'm trying the absolute latest version now, but it always takes a while before unet1 is trained so it will be a while.

My 0.25.4 config for unet2:

unet2 = Unet(
    dim = 80,
    cond_dim = 512,
    dim_mults = (1, 2, 4, 8),
    num_resnet_blocks = (2, 4, 8, 8),
    cond_images_channels = 3,
    layer_attns = (False, False, False, True),
    layer_cross_attns = (False, False, False, True),
    memory_efficient = True,
)

My config in 0.8.8 is exactly the same, except i was only using dim 64 there. (So even with such a low dim size, i had pretty decent results at the time)