trinker / textstem

Tools for fast text stemming & lemmatization
44 stars 8 forks source link

koRpus tree tagger broken #4

Closed trinker closed 7 years ago

trinker commented 7 years ago

I am a long time user of the koRpus package and on a Windows machine. I recently upgraded my version of koRpus and when I run:

koRpus::treetag(c("run", "ran", "running"), treetagger="manual", format="obj",
                      TT.tknz=FALSE , lang="en",
                      TT.options=list(path="C:/TreeTagger", preset="en"))

I get the error:

 Error: Specified file cannot be found:
 C:/TreeTagger/cmd/utf8-tokenize.pl 

Switching to the version 0.06-5 makes this go away. The reason is on a Windows machine there is no cmd/utf8-tokenize.pl it's called cmd/utf8-tokenize.perl.

The package textstem that I maintain depends on koRpus for this functionality thus it concerns me that other Windows users are unable to utilize this functionality.

I am using the most current treetagger version available from here: http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/

Thank you for attention to this.

trinker commented 7 years ago

non-issue fixed in v. 0.10-2