flexpaper / pdf2json

PDF2JSON is a conversion library based on XPDF (3.02) which can be used for high performance PDF page by page conversion to JSON and XML format. It also supports compressing data to minimize size. PDF2JSON is available for Windows, OSX and Linux. Please see https://flowpaper.com for more information
305 stars 52 forks source link

co-ordinates eem to have no relation to co-ordinates for object #9

Open paulatwilson opened 8 years ago

paulatwilson commented 8 years ago

I have been trying to work out where objects are on the page and your Height and Width for the page makes no sense at all. for example I just had a look and m PDF is: H 842 x w 959 but your software is reporting it at H: 1262 x W: 892

The same goes for elements, I cannot work any of this out.

Top in the PDF says: 204, but the export form your software says: 963 L in acrobat says 219, yours says: 332 I cannot see any relationship at all between any of it. Please help!

I don't know C++ so I am unable to understand exactly what this is doing and how it is calculating things.

Paul

flexpaper commented 7 years ago

The pdf2json library export pages in scale 1.5 by default thats why you're seeing these differences.