Noahs-ARK / semafor

http://www.ark.cs.cmu.edu/SEMAFOR
GNU General Public License v3.0
95 stars 47 forks source link

JVM-only full pipeline (raw text to SEMAFOR-parsed json or xml) #14

Open sammthomson opened 9 years ago

sammthomson commented 9 years ago

This is mostly done in the develop and concrete branches... it just needs to be cleaned up and merged into master. Use Stanford CoreNLP for sentence-splitting and tokenization; Morpha for lemmatization; Malt for dep parses.