pgcorpus / gutenberg

Pipeline to generate the Standardized Project Gutenberg Corpus
https://zenodo.org/record/2422561
GNU General Public License v3.0
158 stars 38 forks source link