Closed ArnolFokam closed 3 years ago
In addition to providing these benchmark results,
Besides that, these results were obtained while training a transformer model for approximately 100 epochs.
Check out this pull request on
See visual diffs & provide feedback on Jupyter Notebooks.
Powered by ReviewNB
In addition to providing these benchmark results,
Besides that, these results were obtained while training a transformer model for approximately 100 epochs.