ReaLLMASIC / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.
MIT License
23 stars 17 forks source link

Add scripts creating compatibility for additional dataset #187

Closed klei22 closed 2 months ago

klei22 commented 2 months ago

(This branches from the Vizier PR)

Adding scripts compatible with python-codes-25k, which is an instruction following python dataset of MIT license.