tatuylonen / wikitextprocessor

Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
Other
93 stars 23 forks source link

Link parsing: more broken link logic #272

Closed kristian-clausal closed 5 months ago

kristian-clausal commented 5 months ago

If a link is in the form [[Something#other|]] it is rendered as text and is broken and we will just ignore it.

Links that are not broken:

[[Something|]] without # in the first arg

[[#Other||]] with another pipe; everything after the first pipe is part of the second argument for link text. It is rendered as "|"...