tatuylonen / wikitextprocessor

Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
Other
93 stars 23 forks source link

Use bz2 Python library if `lbzcat` and `bzcat` are not installed #279

Closed xxyzz closed 5 months ago

xxyzz commented 5 months ago

Resolves https://github.com/tatuylonen/wiktextract/issues/371, but slower than lbzcat.