jzillmann / pdf-to-markdown

A PDF to Markdown converter
https://pdf2md.morethan.io
MIT License
1.17k stars 189 forks source link

is there any option to convert pdf to mark down with embedded images #3

Open jayanh opened 7 years ago

jayanh commented 7 years ago

Hi! This version can only convert to text, so is there any way/option to convert with media(images...)

Thanks

jzillmann commented 7 years ago

Currently not, sorry. I've thought about it... think pdf.js allows to extract media... but haven't tried it and for my use it was irrelevant...

marky-mark commented 4 years ago

Table data would be great too

berserkwarwolf commented 3 years ago

@jzillmann Would you accept a bounty for this feature ?

jzillmann commented 3 years ago

@berserkwarwolf What exactly ?

  1. You want media extracted ?
  2. You want media extracted and included in the markdown as links (probably downloaded as a folder) ?
  3. Table data ?
flywire commented 3 years ago

https://pdfbox.apache.org/

REM for %f in (*.pdf) do extract "%f"
java -jar pdfbox-app-2.0.24.jar ExtractImages %1