jsvine / pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
MIT License
6.1k stars 625 forks source link

Extract text without tables #314

Closed Spiritus44 closed 3 years ago

Spiritus44 commented 3 years ago

Hello,

First thank you for this great library, it really help me to quick process pdf files.

I'm working on a PDF summarizer and I use pdfplumber to export all the text of the PDF. Unfortunately, I have documents which contains tables and (to make the summary process easier) I want to just export the text without the tables cells.

According to you what is the best manner to do that ?

Thanks a lot for answer me,

samkit-jain commented 3 years ago

Hi @Spiritus44 Appreciate your interest in the library. Please have a look at issue #242 to extract text excluding tables.