Belval / pdf2image

A python module that wraps the pdftoppm utility to convert PDF to PIL Image object
MIT License
1.64k stars 195 forks source link

When trying to convert pdf FORM it does not show checkboxes in image. #213

Closed jehato47 closed 6 months ago

jehato47 commented 3 years ago

When trying to convert pdf FORM it does not show checkboxes in image.

To Reproduce RFC-36.pdf

Here is the sample pdf file. If you try to convert it to images it will not show checkboxes in image.

Image of the pdf : output

Belval commented 3 years ago

This is weird, I just tested it and I can confirm that bounding boxes do show up:

out

What's your OS? I am on OS X with pdftoppm v20.12.1

jehato47 commented 3 years ago

I am on windows.

It can perform different in os x.

Belval commented 3 years ago

Try to convert it manually with pdftoppm -r 300 -jpeg your_file.pdf out and see if there are any warnings. Since the installation on Windows is not handled by a package manager it is not uncommon to have missing packages which cause pdf2image to fail silently.

jehato47 commented 3 years ago

Can you try it for this file:

output.pdf

First file was performed well manually. You can see problem in this file. I sent you wrong file.