DARIAH-DE / DARIAH-DKPro-Wrapper

Wrapper for DKPro Core to extract lingustic information from books.
http://dariah-de.github.io/DARIAH-DKPro-Wrapper
Apache License 2.0
16 stars 8 forks source link

Resume does not skip existing files with space characters in the filenames #26

Closed thvitt closed 7 years ago

thvitt commented 7 years ago

When an original file name contains a space character, e.g., Goethe,-Johann-Wolfgang_Wilhelm Meisters Lehrjahre.txt, it is encoded as '%20' in the output file name → Goethe,-Johann-Wolfgang_Wilhelm%20Meisters%20Lehrjahre.txt.csv.

The automatic resuming does not recognize this dependency and reprocesses the files.