biosemantics / etc-site-archived-do-not-use

Source code for the ETC Toolkit web application
http://etc.cs.umb.edu/etcsite/
1 stars 0 forks source link

file names with non-ascii characters #655

Open hongcui opened 7 years ago

hongcui commented 7 years ago

If a file name contains a non-ascii character, the Text Capture - Parse will report

ERROR edu.arizona.biosemantics.semanticmarkup.markupelement.description.io.lib.MOXyBinderDescriptionReader:114 - Could not read input file /var/lib/etcsite/data/textCapture/charaparser/231/out/Isoëtes echinospora.xml java.io.FileNotFoundException: /var/lib/etcsite/data/textCapture/charaparser/231/out/Isoëtes echinospora.xml (No such file or directory)

And the description in the file will not be parsed.