google / trax

Trax — Deep Learning with Clear Code and Speed
Apache License 2.0
8.07k stars 813 forks source link

The evaluation notebook for the NeurIPS paper "Sparse is Enough in Scaling Transformers". #1709

Closed copybara-service[bot] closed 2 years ago

copybara-service[bot] commented 2 years ago

The evaluation notebook for the NeurIPS paper "Sparse is Enough in Scaling Transformers".