issues
search
tatuylonen
/
wiktextract
Wiktionary dump file parser and multilingual data extractor
Other
799
stars
82
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[ja] extract translation sections and fix sound section code
#752
xxyzz
closed
2 months ago
0
[ja] extract sound sections
#751
xxyzz
closed
2 months ago
0
[ja] extract etymology sections and nested gloss lists
#750
xxyzz
closed
2 months ago
0
[ja] header line code changes
#749
xxyzz
closed
2 months ago
0
[ja] extract POS section header line nodes
#748
xxyzz
closed
2 months ago
0
[ja] extract example lists
#747
xxyzz
closed
2 months ago
0
[pl] break long function in "example.py" file and extract note sections
#746
xxyzz
closed
2 months ago
0
`colloquial` tag missing for German form `frägst`
#745
not-my-profile
closed
2 months ago
2
[pl] extract linkage sections and fix translation code
#744
xxyzz
closed
2 months ago
0
A few entries missing
#743
GrimPixel
closed
1 month ago
2
[pl] fix example section code and extract pronunciation section
#742
xxyzz
closed
2 months ago
0
[pl] extract grammatical tags and form-of data
#741
xxyzz
closed
2 months ago
0
[fr] fix etymology code for pages don't use italic POS node
#740
xxyzz
closed
2 months ago
0
[fr] find tables under level 3 title in "fr-conj-1-ier" template
#739
xxyzz
closed
2 months ago
0
[pl] extract etymology and translation sections
#738
xxyzz
closed
2 months ago
0
[pl] translate some tags and topics data start with "a" and b"
#737
xxyzz
closed
2 months ago
0
[en] combine tags for pages have more than one head line
#736
xxyzz
closed
2 months ago
0
[en] simplify code removes parentheses text in `parse_word_head()`
#735
xxyzz
closed
2 months ago
0
[en] don't process soft redirect templates in `parse_language()`
#734
xxyzz
closed
2 months ago
0
[fr] fixes for extract conjugaison pages
#733
xxyzz
closed
2 months ago
0
Missing tags for German 'Butter'
#732
StefanVukovic99
closed
2 months ago
1
Part of the Chinese characters information off the line
#731
GrimPixel
closed
2 months ago
1
Canonical forms broken
#730
GrimPixel
closed
2 months ago
0
Tags in canonical form for German 'Herz'
#729
StefanVukovic99
closed
2 months ago
0
[pl] extract example section
#728
xxyzz
closed
2 months ago
0
[ja] add Japanese Wiktionary extractor
#727
xxyzz
closed
2 months ago
0
[pl] add Polish Wiktionary extractor
#726
xxyzz
closed
2 months ago
0
[fr] extract more "*-conj*" templates in "Conjugaison" pages
#725
xxyzz
closed
2 months ago
0
Inflection Tables Missing from Certain Entries In French Wiktionary
#724
ryellman
closed
2 months ago
11
[zh, es] combine example templates and add some POS titles
#723
xxyzz
closed
2 months ago
0
[es, zh] extract no list pages and rewrite zh thesaurus code
#722
xxyzz
closed
2 months ago
0
[de] extract more data from "Grammatische Merkmale" section
#721
xxyzz
closed
2 months ago
0
Handle `lit` arguments in example templates
#720
kristian-clausal
closed
2 months ago
0
[de] add examples that don't have matched gloss
#719
xxyzz
closed
2 months ago
0
[ru] changes for low quality pages
#718
xxyzz
closed
2 months ago
0
[ru] extract more low quality pages
#717
xxyzz
closed
2 months ago
0
[zh] extract data from low quality German pages
#716
xxyzz
closed
2 months ago
2
In examples, "taxonomic" is the same as "english"
#715
kristian-clausal
closed
2 months ago
1
[fr] find etymology POS title text inside italic node
#714
xxyzz
closed
2 months ago
1
[fr] extract more linkage and pos section data
#713
xxyzz
closed
2 months ago
0
Remove taxonomy in classify_desc()
#712
kristian-clausal
closed
2 months ago
4
[fr] reduce empty gloss data
#711
xxyzz
closed
2 months ago
0
[fr] use the `default` keyword of `pydantic.Field()`
#710
xxyzz
closed
2 months ago
0
[en] add empty gloss and "no-gloss" tag for "zh-see" template
#709
xxyzz
closed
2 months ago
0
More cleaning
#708
kristian-clausal
closed
2 months ago
9
[zh] fix sound data for some pages and extract more "form-of" templates
#707
xxyzz
closed
3 months ago
0
[es] fix `ValueError` exception in page "metegol"
#706
xxyzz
closed
3 months ago
0
Add a comment of why ignore E501 rule in tests directory
#705
xxyzz
closed
3 months ago
0
[zh] category and sounds fixes
#704
xxyzz
closed
3 months ago
0
[fr] add category links in etymology section
#703
xxyzz
closed
3 months ago
0
Previous
Next