issues
search
ogrisel
/
pignlproc
Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.
158
stars
64
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Hadoop 2 and AWS EMR 5 compatibility
#15
jzonthemtn
opened
7 years ago
0
Fix link to blog post in README.
#14
miku
closed
9 years ago
1
Fixed issue #9. Just added missing parameters as ''
#13
alfonsonishikawa
opened
10 years ago
2
Fixed issue #9. Just added missing parameters as ''
#12
alfonsonishikawa
closed
10 years ago
2
Update AnnotatingMarkupParser.java
#11
alfonsonishikawa
closed
10 years ago
1
compatibility fix for Pig version 0.10.0
#10
maxjakob
closed
12 years ago
2
Could not instantiate 'pignlproc.storage.UriUriNTriplesLoader'
#9
frankscholten
closed
9 years ago
3
Small fix
#8
sfermigier
closed
12 years ago
1
Error Building a corpus from Italian Wikipedia
#7
raymanrt
opened
13 years ago
11
Invalid resource schema: bag schema must have tuple as its field
#6
raymanrt
opened
13 years ago
4
Wikilinks
#5
raymanrt
closed
13 years ago
2
NER corpus: add special treatment for first sentence of article
#4
ogrisel
opened
13 years ago
0
Resolve the redirects links from DBpedia
#3
ogrisel
closed
13 years ago
1
Update the EC2 wikipage to use Whirr 0.5-incubating
#2
ogrisel
opened
13 years ago
0
fix up pom.xml
#1
kryton
closed
13 years ago
3