ReaLLMASIC / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.
MIT License
23 stars 17 forks source link

Add dim to init functions for softmax variations #169

Closed gkielian closed 3 months ago