keras-team / keras-nlp

Modular Natural Language Processing workflows with Keras
Apache License 2.0
738 stars 216 forks source link

Add `GPTNeoX` Model #1052

Open shivance opened 1 year ago

shivance commented 1 year ago

Pythia is a suite of 16 LLMs all trained on public data seen in the exact same order and ranging in size from 70M to 12B parameters. The model was developed with intention to facilitate research in many areas. That's why I think this would be a good addition to KerasNLP. I'll work on adding following as a part of Google Summer of Code

innat commented 1 year ago

https://github.com/keras-team/keras-nlp/issues/929