attardi / wikiextractor

A tool for extracting plain text from Wikipedia dumps
GNU Affero General Public License v3.0
3.76k stars 969 forks source link

Add options for a bare text format & removing empty documents #316

Open AngledLuffa opened 1 year ago