issues
search
tatuylonen
/
wikitextprocessor
Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
Other
89
stars
23
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Bump crate-ci/typos from 1.22.3 to 1.23.1
#295
dependabot[bot]
closed
1 day ago
1
Mypystuff
#294
kristian-clausal
closed
5 days ago
0
Set empty string as Lua title object `fragment` field default value
#293
xxyzz
closed
1 week ago
0
Use connection context manager in `get_entity_data()`
#292
xxyzz
closed
2 weeks ago
0
Only parse `----` as horizontal rule if it's at the start of line
#291
xxyzz
closed
2 weeks ago
1
Implement `#rel2abs` parser function
#290
xxyzz
closed
3 weeks ago
1
Bump crate-ci/typos from 1.21.0 to 1.22.3
#289
dependabot[bot]
closed
4 weeks ago
1
Update XML dump file namespace version
#288
xxyzz
closed
1 month ago
9
Remove upper case substitution modifiers
#287
xxyzz
closed
1 month ago
1
Update el edition namespace data
#286
xxyzz
closed
1 month ago
0
Change external links `[...]` regex
#285
kristian-clausal
closed
1 month ago
2
Update some namespace file and Scribunto git submodule
#284
xxyzz
closed
1 month ago
0
Update example usage code
#283
xxyzz
closed
2 months ago
0
assert error at src/parse.py ln 2287
#282
kylefoley76
closed
1 month ago
2
Don't add too many template args debug message for empty args
#281
xxyzz
closed
2 months ago
0
Bump crate-ci/typos from 1.20.4 to 1.21.0
#280
dependabot[bot]
closed
2 months ago
1
Use bz2 Python library if `lbzcat` and `bzcat` are not installed
#279
xxyzz
closed
2 months ago
0
Fix `dl` HTML tags can't have other HTML children bug
#278
xxyzz
closed
2 months ago
0
Remove the limit of unnamed template argument number
#277
xxyzz
closed
2 months ago
2
Unescape "*" to "*" in `mw.uri.anchorEncode()`
#276
xxyzz
closed
2 months ago
0
Add GH issue and Wiktionary links to `test_italics_in_table_header`
#275
xxyzz
closed
2 months ago
0
Fix an issue with TOKEN_RE
#274
kristian-clausal
closed
2 months ago
0
Create `Logger` object and set `Logging.DEBUG` level
#273
xxyzz
closed
2 months ago
5
Link parsing: more broken link logic
#272
kristian-clausal
closed
2 months ago
0
Run `mw.loadData()` in cloned environment
#271
xxyzz
closed
2 months ago
2
Remove unnecessary warning and logging setting code
#270
xxyzz
closed
2 months ago
3
Return empty links ("[[ ]]") as escaped text
#269
kristian-clausal
closed
2 months ago
0
Match quoted `begin` `end` attribute of the `section` tag for `#lst`
#268
xxyzz
closed
3 months ago
0
Change to link detection regex
#267
kristian-clausal
closed
3 months ago
1
Can't parse link nodes contain newline character
#266
xxyzz
closed
3 months ago
9
Bump crate-ci/typos from 1.19.0 to 1.20.4
#265
dependabot[bot]
closed
3 months ago
1
Implement "NUMBEROFPAGES" and "NUMBEROFARTICLES" magic words
#264
xxyzz
closed
3 months ago
0
Fix 'AttributeError' object has no attribute 'gsub' error in de edition
#263
xxyzz
closed
3 months ago
0
Fix the rest errors when expanding French Wikipedia's "liste des dirigeants successifs" template in page "Ford"
#262
xxyzz
closed
3 months ago
0
non-interpretation of certain {{...}} & [[...]]
#261
LeMoussel
closed
3 months ago
1
Implement `mw.wikibase.getEntity` Lua API
#260
xxyzz
closed
3 months ago
1
Install lupa package's wheel file
#259
xxyzz
closed
3 months ago
0
Undeclared variable assignments from modules without require ('strict');
#258
olsaarik
closed
2 weeks ago
2
Data dumps do not contain interwiki link (Wikidata) data
#257
kristian-clausal
closed
3 months ago
15
Implement mw.language:formatDate()
#256
kristian-clausal
closed
3 months ago
1
Remove `Wtp.file_aliases` and simplify the `Wtp.__init__()` function
#255
xxyzz
closed
3 months ago
11
Add `//` to allowed URL prefixes
#254
kristian-clausal
closed
4 months ago
0
Add `file_aliases` parameter to Wtp
#253
kristian-clausal
closed
4 months ago
1
Determine a HTML tag is self-closing if it ends with "/>"
#252
xxyzz
closed
4 months ago
2
Restore HTML named entity regex pattern
#251
xxyzz
closed
4 months ago
0
Fix misuse of `Match.start()` causes template expanded as text bug
#250
xxyzz
closed
4 months ago
2
Implements Day Name, Time and Hour
#249
LeMoussel
closed
4 months ago
3
Implement {{LOCALTIMESTAMP}} parserfns
#248
LeMoussel
closed
4 months ago
0
Fix HTML regex pattern and add a test for `mw.text.decode()`
#247
xxyzz
closed
4 months ago
1
Implement localdow parserfns
#246
LeMoussel
closed
4 months ago
1
Next