Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
93
stars
23
forks
source link
Link parsing: more broken link logic #272
Closed
kristian-clausal closed 5 months ago
If a link is in the form
[[Something#other|]]
it is rendered as text and is broken and we will just ignore it.Links that are not broken:
[[Something|]]
without # in the first arg[[#Other||]]
with another pipe; everything after the first pipe is part of the second argument for link text. It is rendered as "|"...