issues
search
EleutherAI
/
gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
https://www.eleuther.ai/
Apache License 2.0
6.95k
stars
1.02k
forks
source link
fix gpt-j residual bias assumption
#1278
Closed
dmahan93
closed
2 months ago