togethercomputer / RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Apache License 2.0
4.57k stars 350 forks source link

Update README.md #117

Closed mauriceweber closed 3 months ago

mauriceweber commented 3 months ago

This PR fixes the link for the wikipedia references classifier.