keras-team / keras-nlp

Modular Natural Language Processing workflows with Keras
Apache License 2.0
758 stars 227 forks source link

Support GPT-NeoX 20B (LLM) #929

Open pure-rgb opened 1 year ago

pure-rgb commented 1 year ago

GPT-NeoX

Official pytorch: https://github.com/EleutherAI/gpt-neox Unofficial jax: https://github.com/kingoflolz/mesh-transformer-jax

abheesht17 commented 1 year ago

+1

This is the open source version of GPT-3, right?

pure-rgb commented 1 year ago

yes, as far as I know. @abheesht17 wdyt? Is keras nlp ready to support this type of model? It's a very big model and unlikely to train for all.

pure-rgb commented 1 year ago

Minimal code pytorch and flax

Some reading.

People use jax/flax to leverage TPU: