bigscience-workshop / catalogue_data

Scripts to prepare catalogue data
Apache License 2.0
8 stars 1 forks source link

Add script to generate the columns for deduplication and short filter document #44

Closed thomasw21 closed 2 years ago