atlanhq / camelot

Camelot: PDF Table Extraction for Humans
https://camelot-py.readthedocs.io
Other
3.65k stars 355 forks source link

OSError: Ghostscript Installation error #489

Open elvinagam opened 2 years ago

elvinagam commented 2 years ago

Even if i am using all these on Google colab to correctly install camelot, I am getting error when reading pdfs. It should not be this hard to install a simple library and run one line of code. If there is other way around, please let me know.

pip uninstall camelot
pip uninstall camelot-py
pip install camelot-py[cv]

# install ghostscript
! apt install ghostscript python3-tk
! pip install ghostscript

Error log from Google COLAB OSError: Ghostscript is not installed. You can install it using the instructions here: https://camelot-py.readthedocs.io/en/master/user/install-deps.html

krishnasism commented 1 year ago

@elvinagam were you able to solve this?

elvinagam commented 1 year ago

@krishnasism we forked camelot and removed ghostscript dependency as it started acting up a lot. You can go ahead and use this one:

https://github.com/h2oai/camelot

There is a slight difference with installation.

bosd commented 1 month ago

Hey all!

We try to build a maintained fork at pypdf_table_extraction.

You are welcome to check it out and contribute there.