mlfoundations / scaling

Language models scale reliably with over-training and on downstream tasks
MIT License
93 stars 5 forks source link