d40cht / Careers

Wikipedia-dump based natural language processing for named entity recognition
3 stars 1 forks source link

Fix run of pdf2text #7

Closed d40cht closed 13 years ago

d40cht commented 13 years ago

So:

a) It can cope with spaces in filenames b) If it fails, it doesn't upload the last CV text processed (so it deletes any existing tmp.txt before running).

d40cht commented 13 years ago

Fixed, including using withTemporaryDirectory.