jsvine / pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
MIT License
6.57k stars 659 forks source link

Comparision of pdfplumber with related libraries #257

Closed MartinThoma closed 4 years ago

MartinThoma commented 4 years ago

The PDF tooling world is huge. For a beginner it is hard to tell which library to use. It would be nice to have a comparison.

Is it possible to combine tools, e.g. use logic from camelot to recognize tables and create a HTML output which has semantic HTML tables instead of absolutely positioned cells? (related question on Stackoverflow)

MartinThoma commented 4 years ago

Thank you so much! I think once this PR is merged, this issue can be closed :-)

jsvine commented 4 years ago

Thanks @MartinThoma, I thought that was a great idea. I've added a brief comparison, as you can see, and will close this issue. But if there are any lingering questions about the differences, don't hesitate to add a comment here.