tarsqi / ttk

Tarsqi Toolkit
Apache License 2.0
25 stars 10 forks source link

Unable to open 'english-utf8.par' #89

Closed namrathaurs closed 5 years ago

namrathaurs commented 5 years ago

After setting up the Tarsqi toolkit and its associated components, the following error is being encountered when tarsqi.py script is executed on the sample XML input

$python tarsqi.py data/in/simple-xml/tiny.xml out.xml reading parameters ... ERROR: Can't open for reading: /tree_tagger/lib/english-utf8.par aborted

I made sure to follow all the steps outlined as part of the installation procedure. Is there something that I may have possibly missed? Any help will be greatly appreciated!

Thanks, Namratha

namrathaurs commented 5 years ago

UPDATE: This issue occurs due to a missing file english-utf8.par in the lib folder of the IMS Tree Tagger. Upon further inspection of the tagger parameter files for English, the file (trained on Penn treebank) is named english.par. Renamed this to english-utf8.par and the tarsqi.py script executes without any errors.