pdftables / python-pdftables-api

Python library to interact with https://pdftables.com API
https://pdftables.com/api
BSD 3-Clause "New" or "Revised" License
85 stars 33 forks source link

Does this work for scanned pdfs? #10

Closed ajithvcoder closed 6 years ago

ajithvcoder commented 6 years ago

its working for pdf files that are typed pdf but for scanned pdf output excel file is corrupted.

StevenMaude commented 6 years ago

Hi!

PDFTables.com doesn't currently extract scanned documents; however, as the linked page mentions, you can try running your PDF through OCR software first and then submitting it to PDFTables.com.

Hope that helps 🙂

ajithvcoder commented 6 years ago

Thanks will try that method