LoicGrobol / zeldarose

Train transformer-based models.
https://zeldarose.readthedocs.io
Other
28 stars 3 forks source link

Add SWA #60

Open LoicGrobol opened 1 year ago

LoicGrobol commented 1 year ago

See https://lightning.ai/docs/pytorch/stable/advanced/training_tricks.html#stochastic-weight-averaging