I came across some weird PDF files for which pdffonts outputs invalid UTF-8 chars. This results in a "invalid UTF-8 ..." exception when matching NO_TEXT_DETECTED.
If Ruby 1.9/2.0 compatability is required, I can also extend this pull request with some scrub-polyfill.
I came across some weird PDF files for which pdffonts outputs invalid UTF-8 chars. This results in a "invalid UTF-8 ..." exception when matching NO_TEXT_DETECTED.
If Ruby 1.9/2.0 compatability is required, I can also extend this pull request with some scrub-polyfill.