py-pdf / pypdf_table_extraction

A Python library to extract tabular data from PDFs
https://pypdf-table-extraction.readthedocs.io
MIT License
38 stars 15 forks source link

Do we need a camleot uninstalled in order to use this library? #191

Open dejanmarkovic opened 1 day ago

dejanmarkovic commented 1 day ago

Hi team,

Thank you so much for maintaining this package!

I have a few questions though as I have not found those simple answers in the documentation.

  1. Do we need to uninstall a Camelot in order to use this library?
  2. What is the proper way to use this library

tables = **pypdf_table_extraction**.read_pdf(file_path, pages="all", flavor=network) or tables = **camelo**t.read_pdf(file_path, pages="all", flavor="stream")

  1. I have tried to use the flavor network or hybrid tables = pypdf_table_extraction.read_pdf(file_path, pages="all", flavor=network) and got the error 'An error occurred with stream flavor: name 'network' is not defined'.

Here is the output from the conda list pypdf-table-extraction 0.0.2 pypi_0 pypi

I appreciate your feedback. Thanks!

bosd commented 10 hours ago

Do we need to uninstall a Camelot in order to use this library?

Would like to know this as well. Currently I'm not using them simultaneously. I only use the new library.

What is the proper way to use this library Both the old name as the new one should work.

Afaik, the flavor argument should be formatted as a string with ""

stefan6419846 commented 10 hours ago

In theory you can install both libraries. Nevertheless, you will usually experience side effects: When importing from the camelot namespace, depending on the installation order the package content might differ. Additionally, uninstalling camelot will remove the camelot directory/module shipped by pypdf_table_extraction as well, possibly breaking existing code.