Closed LinB203 closed 8 months ago
Why does ema model not need gradients only when loading pre-trained weights?
https://github.com/willisma/SiT/blob/main/train.py#L165
DiT's ema model always does not need gradients.
https://github.com/facebookresearch/DiT/blob/main/train.py#L148
Hi,
Thank you for spotting the issue, the ema model indeed does not require grad at all time. The code has been updated accordingly.
Why does ema model not need gradients only when loading pre-trained weights?
https://github.com/willisma/SiT/blob/main/train.py#L165
DiT's ema model always does not need gradients.
https://github.com/facebookresearch/DiT/blob/main/train.py#L148