knmnyn / ParsCit

An open-source CRF Reference String Parsing Package
http://wing.comp.nus.edu.sg/parsCit
GNU Lesser General Public License v3.0
155 stars 47 forks source link

Error on files with white spaces #19

Closed pchaigno closed 5 years ago

pchaigno commented 8 years ago

citeExtract.pl fails on files with white spaces. For instance, ./citeExtract.pl -m extract_all /pdfs/test\ 2.txt throws the following error:

mv: target '2.txt' is not a directory
Can't use string ("Could not open text file /pdfs/t"...) as an ARRAY ref while "strict refs" in use at /ParsCit/bin/../lib/ParsCit/PreProcess.pm line 340.

It works fine if the file is named test2.txt.

knmnyn commented 8 years ago

Thanks for the comment. We'll be fixing this soon.

cmkumar87 commented 5 years ago

We aren't developing parscit anymore. Please use neural version of the parser here. https://github.com/WING-NUS/Neural-ParsCit. This is a theano based version. We are actively developing a pytorch based version which we will release soon.