MorphDiv / TeDDi_sample

Text Data Diversity Sample (TeDDi Sample)
Other
5 stars 3 forks source link

Removed Gutenberg meta info and full license #267

Closed olgapelloni closed 2 years ago

olgapelloni commented 2 years ago

Cleaning Gutenberg texts from meta info (phrases like "START GUTENBERG PROJECT") and full license text. Here I push the changes in the English texts. This branch is still in progress, texts in 4 more languages are to be cleaned.