tatuylonen / wikitextprocessor

Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
Other
93 stars 23 forks source link

Add `file_aliases` parameter to Wtp #253

Closed kristian-clausal closed 6 months ago

kristian-clausal commented 6 months ago

Some wikiprojects have translated syntax for [[File / [[Image image/file display links, like French wikipedia Fichier.

In Wiktextract clean_value, links with File and Image are basically returned as empty strings, so we'll continue doing that for now; might change things for class=noviewer which inlines an image, and which might require handling of alt texts.

xxyzz commented 6 months ago

Namespace alias data contain this "Fichier" value, I should update the "get_namespaces.py" file and namespace json files.