issues
search
idio
/
json-wikipedia
Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby
17
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Incorrect namespaces handling in parallel mode
#25
keynmol
closed
9 years ago
0
Build jars with custom names
#24
keynmol
closed
9 years ago
1
Garbage namespaces
#23
dav009
opened
9 years ago
0
Lots of Missing Annotations
#22
dav009
closed
9 years ago
1
Colon extraction - Language Namespaces
#21
dav009
closed
9 years ago
1
Fixing colon bug - Extracting weird sf and topic Ids
#20
dav009
closed
9 years ago
3
Fixing colon bug - Extracting weird sf and topic Ids
#19
dav009
closed
9 years ago
0
"Wikipedia:" namespace has wrong ids
#18
keynmol
closed
9 years ago
2
Missing annotations in images outside of galleries
#17
keynmol
opened
9 years ago
0
Articles failing with ArrayIndexOutOfBoundsException
#16
keynmol
closed
9 years ago
1
Handle empty <text ...> tag correctly
#15
keynmol
closed
9 years ago
0
Incorrect handling of empty text
#14
keynmol
closed
9 years ago
1
Split invalid gallery paragraphs correctly.
#13
keynmol
closed
9 years ago
1
Exception in convertGalleriesToImages
#12
keynmol
closed
9 years ago
1
Removing lib folder
#11
dav009
closed
9 years ago
2
Merging JWPL code
#10
dav009
closed
9 years ago
4
Updating JWPL Dependencies - Colon URIs
#9
dav009
closed
9 years ago
3
Fix - Adding annotations from Tables & Lists
#8
dav009
closed
9 years ago
1
Annotations containing `:`
#7
dav009
closed
9 years ago
5
symbol in text/annotations
#6
keynmol
closed
5 years ago
3
Feature/dp fixing parallel processing
#5
dav009
closed
9 years ago
6
Adding new JWPL parser version
#4
dav009
closed
9 years ago
0
Speeding up - Using Spark
#3
dav009
closed
9 years ago
10
Feature - 2nd Case - Filtering empty WikiIds
#2
dav009
closed
9 years ago
2
Feature - Handling empty anchors
#1
dav009
closed
9 years ago
2
Previous