PDF2JSON is a conversion library based on XPDF (3.02) which can be used for high performance PDF page by page conversion to JSON and XML format. It also supports compressing data to minimize size. PDF2JSON is available for Windows, OSX and Linux. Please see https://flowpaper.com for more information
305
stars
52
forks
source link
The converter is modified to extract font details of each word includ… #15
…ing fontname,fontsize,bold,italic,oblique,color .