clulab / pdf2txt

Convert PDF files to TXT
Apache License 2.0
31 stars 5 forks source link

Added a fix on digits separated by newlines #16

Closed hubert10 closed 2 years ago

hubert10 commented 2 years ago

@kwalcock,

Restricting digits separator to find spaces or tabs, but not newlines fixes my problem. Could you review this PR?

I think I didn't understand well what StringUtils.escape(string) was doing but now I understand.

Thanks