Closed guillaumeguy closed 2 weeks ago
@guillaumeguy ah yes, that shouldn't be there
could you try 1.35.2?
Works great! Thank you for the quick turnaround!
@guillaumeguy you are one of the few using a full encoder decoder transformer
how are you using it?
It seems that parameter
encoder.to_logits.weight
in an Encoder/Decoder is not part of the forward pass. This creates problems when running DDP.Should these weights be removed in such case?
Reproducible example: