huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
https://huggingface.co/transformers
Apache License 2.0
128.63k stars 25.51k forks source link

pointer to transformer (big) model #16747

Open anirudt opened 2 years ago

anirudt commented 2 years ago

🌟 New model addition

Model description

Hi, needed a pointer on how to instantiate a Transformer-big from the original Vaswani et. al. paper (Attention Is All You Need). I could only find versions of Transformer-like architectures, so would be useful if this could also be added.

Open source status

02shanks commented 1 year ago

@anirudt Can I work on this issue?

VarunGumma commented 1 year ago

Any leads on this?