modesty / pdf2json

converts binary PDF to JSON and text, for server-side PDF processing and command-line use.
https://github.com/modesty/pdf2json
Other
2.01k stars 377 forks source link

Get picture data in pdf? #256

Open bigbird231 opened 2 years ago

bigbird231 commented 2 years ago

I want to get picture data from pdf. now I can only get text data from pdf by pdf2json,is it possible to get picture data? any ideas,thankyou!

reregaga commented 2 years ago

@bigbird231, no, still under development: https://github.com/modesty/pdf2json/pull/91#issuecomment-979430735 I vote for your proposal.

You can use other libs for getting images, for example mupdf-js can convert pdf to svg and then search in svg image tags.