HeidelTime / heideltime

A multilingual, cross-domain temporal tagger developed at the Database Systems Research Group at Heidelberg University.
GNU General Public License v3.0
342 stars 67 forks source link

missing file for Portuguese #63

Closed arademaker closed 6 years ago

arademaker commented 6 years ago

I got the error

$ java -jar de.unihd.dbs.heideltime.standalone.jar tmp/CF1.raw -t news -l portuguese
[TreeTaggerWrapper] File missing to use TreeTagger tokenizer: portuguese.par
[TreeTaggerWrapper] Cannot find tree tagger (/Users/arademaker/Downloads/heideltime-standalone/tagger/cmd/utf8-tokenize.perl). Make sure that path to tree tagger is set correctly in config.props!
[TreeTaggerWrapper] If path is set correctly:
[TreeTaggerWrapper] Maybe you need to download the TreeTagger tagger-scripts.tar.gz
[TreeTaggerWrapper] from http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/data/tagger-scripts.tar.gz
[TreeTaggerWrapper] Extract this file and copy the missing file into the corresponding TreeTagger directories.
[TreeTaggerWrapper] If missing, copy portuguese-abbreviations-utf8 into /Users/arademaker/Downloads/heideltime-standalone/tagger/lib
[TreeTaggerWrapper] If missing, copy portuguese.par into /Users/arademaker/Downloads/heideltime-standalone/tagger/lib
[TreeTaggerWrapper] If missing, copy utf8-tokenize.perl into /Users/arademaker/Downloads/heideltime-standalone/tagger/cmd

It seems the system is missing the file portuguese.par but this file is not in the tagger-scripts.tar.gz package. Any idea?

BTW, the second line of the error message is strange, I have edited the config file and I setup the path to the tagger properly.

arademaker commented 6 years ago

solved, I downloaded the file from the TreeTagger website.