bigscience-workshop / data_tooling

Tools for managing datasets for governance and training.
Apache License 2.0
74 stars 48 forks source link

Pseudo crawl dataset creation #383

Closed thomasw21 closed 2 years ago