softmax1 / nanoGPT_softmax1

An experiment using nanoGPT vs nanoGPT (softmax1) to see how it affects perplexity score
0 stars 0 forks source link