dbpedia / jsonpedia-extractor

Fine grained massive extraction of Wiipedia content GSoC 2014 Project
6 stars 4 forks source link

Depluralize tokens #8

Closed gigaroby closed 10 years ago

gigaroby commented 10 years ago

Investigate how to de-pluralize tokens (es, lucene or custom java)

Do it on:

http://resources.mpi-inf.mpg.de/yago-naga/javatools/doc/javatools/parsers/PlingStemmer.html

gigaroby commented 10 years ago

Ok using k-stem stemmer kstem