PDF2JSON is a conversion library based on XPDF (3.02) which can be used for high performance PDF page by page conversion to JSON and XML format. It also supports compressing data to minimize size. PDF2JSON is available for Windows, OSX and Linux. Please see https://flowpaper.com for more information
Hello all,
I have found an odd scenario whereby tabs in the PDF text result in no output. For example:
This->is->separated->by->tabs
results in
Thisisseparatedbytabs
I am happy to look at the code if someone might be able to point me at where the text stream is handled.
Hello all, I have found an odd scenario whereby tabs in the PDF text result in no output. For example:
This->is->separated->by->tabs
results inThisisseparatedbytabs
I am happy to look at the code if someone might be able to point me at where the text stream is handled.