huggingface / transformers

๐Ÿค— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
https://huggingface.co/transformers
Apache License 2.0
128.75k stars 25.54k forks source link

Bort (Amazon's reduced BERT) #8135

Closed raulcarlomagno closed 2 years ago

raulcarlomagno commented 3 years ago

๐ŸŒŸ New model addition

Model description

Amazon Alexa researchers extract an optimal subset of architectural parameters for the BERT architecture by applying recent breakthroughs in algorithms for neural architecture search. The proposed optimal subset, โ€œBort,โ€ is just 5.5 percent the effective size of the original BERT-large architecture (not counting the embedding layer), and 16 percent of its net size.

Open source status

using mxnet and gluonnlp

paper https://arxiv.org/pdf/2010.10499.pdf repo https://github.com/alexa/bort

yaysummeriscoming commented 3 years ago

Any update on this one?

LysandreJik commented 2 years ago

This was added in #9112