issues
search
tatuylonen
/
wiktextract
Wiktionary dump file parser and multilingual data extractor
Other
799
stars
82
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[nl] add Dutch edition extractor
#852
xxyzz
closed
1 day ago
0
[simple] Add tests, find tags in glosses
#851
kristian-clausal
closed
1 day ago
0
[de] fix `AttributeError: 'int' object has no attribute 'lower'`
#850
xxyzz
closed
2 days ago
0
[fr] fix `TypeError: 'TemplateNode' object is not iterable`
#849
xxyzz
closed
2 days ago
0
[en] fix typo in test
#848
xxyzz
closed
2 days ago
0
[en] move analyze template code from wikitextprocessor
#847
xxyzz
closed
2 days ago
2
[simple] Parse templates at start of gloss as tags
#846
kristian-clausal
closed
3 days ago
0
[simple] Quick bug fix: 'index' key error in table parsing
#845
kristian-clausal
closed
3 days ago
0
Python 3.10
#844
kristian-clausal
closed
2 days ago
0
clean_value: Remove whitespace before Category links
#843
kristian-clausal
closed
3 days ago
1
[ko] extract etymology list
#842
xxyzz
closed
4 days ago
0
[ko] extract example lists
#841
xxyzz
closed
4 days ago
0
[ko] add Korean edition extractor
#840
xxyzz
closed
5 days ago
0
[ru] extract "омофоны"(homophone) sections
#839
xxyzz
closed
1 week ago
0
[ru] improve linkage section code
#838
xxyzz
closed
1 week ago
0
[fr] extract tables in all "Onglets conjugaison" template tabs
#837
xxyzz
closed
1 week ago
0
[es] extract category links under level 2 node and separate POS category links
#836
xxyzz
closed
1 week ago
0
[de] rename `Example`'s "raw_ref" field to "ref"
#835
xxyzz
closed
1 week ago
0
[fr] extract "lien" and "voir anagrammes" templates in linkage sections
#834
xxyzz
closed
1 week ago
0
[fr] fix link node translation data and extract "Taux de reconnaissance" section
#833
xxyzz
closed
1 week ago
0
[pl] fix check json warnings and move `notes` field
#832
xxyzz
closed
1 week ago
0
[zh] extract "tlb" template and translate some tags in "zh-pron" template
#831
xxyzz
closed
1 week ago
0
[ru] fix `AttributeError` exception in page "opgave" and "denaske"
#830
xxyzz
closed
1 week ago
0
[en] move some en edition code to "extractor/en" folder
#829
xxyzz
closed
1 week ago
3
[ru] extract category templates and "zh-forms" templates
#828
xxyzz
closed
2 weeks ago
0
[ru] extract more translation and example data, also clean up code
#827
xxyzz
closed
2 weeks ago
0
[zh] overwrite some title templates and fix extracted data in low quality pages
#826
xxyzz
closed
2 weeks ago
0
[fr] extract "zh-exemple" example template inside gloss list
#825
xxyzz
closed
2 weeks ago
0
Simple english
#824
kristian-clausal
closed
3 days ago
2
[en] extract "zh-x" example template in etymology section
#823
xxyzz
closed
2 weeks ago
1
[en] don't extract "span" tag in example source "dd" tags
#822
xxyzz
closed
2 weeks ago
0
[zh] improve extract gloss and example data code
#821
xxyzz
closed
2 weeks ago
0
Failing tests for reversing parse with clean_node
#820
kristian-clausal
closed
2 weeks ago
1
[pl] extract literal meaning in example translation
#819
xxyzz
closed
2 weeks ago
0
[ja] extract conjugation table templates use "日本語活用表" Lua module
#818
xxyzz
closed
3 weeks ago
0
[ja] extract accent data from pronunciation section templates
#817
xxyzz
closed
3 weeks ago
0
[pl] update translation test
#816
xxyzz
closed
3 weeks ago
0
[pl] extract "furi" Japanese templates in translation lists
#815
xxyzz
closed
3 weeks ago
0
[ja] improve extract forms and sounds data code
#814
xxyzz
closed
3 weeks ago
0
[zh] extract "Q" example template
#813
xxyzz
closed
3 weeks ago
0
[en] Remove some unused code from page.py
#812
kristian-clausal
closed
3 weeks ago
0
[en, zh] minor changes to extract example data code
#811
xxyzz
closed
3 weeks ago
0
[en] return `ExampleData` from `extract_template_ja_usex()`
#810
xxyzz
closed
3 weeks ago
0
[en, zh] improve extract example templates code
#809
xxyzz
closed
3 weeks ago
0
[en] extract "zh-x" example templates properly
#808
xxyzz
closed
3 weeks ago
3
[de] fix `IndexError` exception and add some title templates
#807
xxyzz
closed
3 weeks ago
0
[ja] improve linkage, etymology, example section code
#806
xxyzz
closed
4 weeks ago
0
[ja] fix `AttributeError` exceptions in `process_sound_template()`
#805
xxyzz
closed
1 month ago
0
[ru] add "save_ns_names" and "extract_ns_names" to `config.json`
#804
xxyzz
closed
1 month ago
0
[es] handle more than one words "Locuciones" section list format
#803
xxyzz
closed
1 month ago
0
Next