Residual connection after decoder attention?

alexdemartos commented 1 year ago

In the decode_from_codebook_indices method there is a residual connection after the decoder attention:

Which is not present in the forward method:

Is this correct?

In my case, I found significantly higher audio quality during inference after removing this residual connection (matching training conditions).

lucidrains commented 1 year ago

@alexdemartos oh gosh yes :man_facepalming: thank you for catching this

lucidrains commented 1 year ago

@alexdemartos does this mean you have already gotten to the stage of sampling from the coarse and fine transformers?

cyanbx commented 1 year ago

thanks for detecting this bug! It helps me a lot

lucidrains commented 1 year ago

ok, i'm guessing he's slinking back off into the darkness to pen his next paper :laughing:

i'll leave him be

lucidrains / audiolm-pytorch