BorealisAI / DT-Fixup

Optimizing Deeper Transformers on Small Datasets https://arxiv.org/abs/2012.15355
15 stars 10 forks source link

Update run.py #7

Closed premshanker-ai closed 1 year ago