Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
2.38k
stars
254
forks
source link
Are you guys planning to implement GQA? #93
Closed
Taekyoon closed 1 year ago
I'm just curious about the development status because I was considering implementing GQA to train with llama2 34, 70b models.