keras-team / keras-nlp

Modular Natural Language Processing workflows with Keras
Apache License 2.0
730 stars 215 forks source link

Porting Gemma 2 transformers checkpoint #1678

Open ariG23498 opened 1 week ago

ariG23498 commented 1 week ago

Porting Gemma 2 transformers checkpoints in Keras NLP

ariG23498 commented 1 day ago

@mattdangerw @grasskin this PR is ready for review!

Note: The KerasNLP Gemma 2 model works only on the JAX backend (for the time being)

Also thanks to the Hugging Face team (Matt et. al.) for providing me with compute to work on this model.