issues
search
dnmilne
/
wikipediaminer
An open source toolkit for mining Wikipedia
130
stars
62
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update WEnvironment.java
#38
ajoorabchi
opened
6 years ago
0
Update languages.xml
#37
ajoorabchi
opened
6 years ago
0
Doc Link Broken
#36
megh1241
opened
7 years ago
0
Can not create article sets.
#35
QuytNguyen
opened
7 years ago
1
"could not identified root directory" when extract dump Vietnamese
#34
QuytNguyen
closed
7 years ago
1
Run the Dump Extractor fail
#33
QuytNguyen
closed
7 years ago
2
Model for Vietnamese?
#32
QuytNguyen
opened
7 years ago
0
Wikpedia Miner as web service slowness
#31
baderex
opened
8 years ago
1
exception from DumpExtractor with Russian or Dutch articles dump
#30
expert-fb
opened
8 years ago
0
enwiki-20110722-pages-articles.xml.bz2 is broken
#29
ali3assi
opened
8 years ago
4
ant build-database fails
#28
vshvedov
opened
9 years ago
6
Dump extractor failing on simple english
#27
rom1504
closed
9 years ago
5
Questions regarding the API
#26
shatu
opened
9 years ago
0
Sharing Wikipedia dump as well as csv extraction file
#25
xiaohan2012
opened
10 years ago
11
Installing wikipedia-miner 1.1
#24
xiaohan2012
closed
10 years ago
1
Summary extraction documentation is missing and some recommendation for it
#23
xiaohan2012
closed
10 years ago
2
The Label compare model for English is broken
#22
hoangyenan
opened
10 years ago
2
Added Spanish models for the mavenized version + spanish text utils
#21
Neuw84
opened
10 years ago
0
avro error
#20
priyolahiri
opened
10 years ago
2
Installing Wikipedia Miner with Chinese(language) dump
#19
wsj14847
opened
10 years ago
3
Dead download links
#18
Hocdoc
opened
10 years ago
2
The topic title returned by annotator not correctly encoded.
#17
ktao
opened
10 years ago
2
Not identifying disambig pages
#16
dnmilne
closed
10 years ago
0
Not extracting isPrimary value for page labels
#15
dnmilne
closed
10 years ago
0
Lots of bloom filter false positives in labelOccurrence extraction step
#14
dnmilne
opened
10 years ago
0
Cache labels that occur as links many times, regardless of link probability
#13
dnmilne
opened
10 years ago
0
Rethink page link caching
#12
dnmilne
opened
10 years ago
4
Investigate distributed caches, like redis
#11
dnmilne
opened
10 years ago
0
Investigate more sophisticated caches, like EHCache or Guava Cache.
#10
dnmilne
opened
10 years ago
1
Rethink label caching
#9
dnmilne
opened
10 years ago
0
Build final summaries
#8
dnmilne
opened
10 years ago
0
Extraction of label occurances
#7
dnmilne
closed
10 years ago
0
Extraction of labels and senses
#6
dnmilne
closed
10 years ago
0
Extraction of page depths
#5
dnmilne
closed
10 years ago
2
Extraction of page summaries
#4
dnmilne
closed
10 years ago
0
Improved caching
#3
dnmilne
closed
10 years ago
0
Jetty Database Configuration
#2
Neuw84
opened
10 years ago
2
Small changes, Spanish models added
#1
Neuw84
closed
10 years ago
2