attardi / wikiextractor

A tool for extracting plain text from Wikipedia dumps
GNU Affero General Public License v3.0
3.69k stars 959 forks source link

Add options for a bare text format & removing empty documents #316

Open AngledLuffa opened 11 months ago