flexpaper / pdf2json

PDF2JSON is a conversion library based on XPDF (3.02) which can be used for high performance PDF page by page conversion to JSON and XML format. It also supports compressing data to minimize size. PDF2JSON is available for Windows, OSX and Linux. Please see https://flowpaper.com for more information
305 stars 52 forks source link

Tabs in text #7

Open acurrieclark opened 8 years ago

acurrieclark commented 8 years ago

Hello all, I have found an odd scenario whereby tabs in the PDF text result in no output. For example: This->is->separated->by->tabs results in Thisisseparatedbytabs

I am happy to look at the code if someone might be able to point me at where the text stream is handled.