togethercomputer / RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Apache License 2.0
4.43k stars 335 forks source link

fixes minor typo in data prep README #73

Open jspeis opened 9 months ago

jspeis commented 9 months ago

Just a very simple typo correction