openai / grok

MIT License
4.06k stars 506 forks source link

OpenAI Grok Curve Experiments

Paper

This is the code for the paper Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets by Alethea Power, Yuri Burda, Harri Edwards, Igor Babuschkin, and Vedant Misra

Installation and Training

pip install -e .
./scripts/train.py