Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
93
stars
23
forks
source link
Use bz2 Python library if `lbzcat` and `bzcat` are not installed #279
Closed
xxyzz closed 5 months ago
Resolves https://github.com/tatuylonen/wiktextract/issues/371, but slower than
lbzcat
.