issues
search
jsvine
/
pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
MIT License
6.02k
stars
619
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Get the Text associated with the hyperlinks - PdfPlumber
#940
mukundhareddy1996
closed
11 months ago
2
Add support for structure tree and marked content sections
#937
dhdaines
closed
11 months ago
9
v0.10.0
#936
jsvine
closed
11 months ago
1
TypeError: argument of type 'PDFObjRef' is not iterable
#935
caolf
opened
12 months ago
8
char' top and bottom attributes exceed page's bbox
#932
kalelsun
closed
12 months ago
2
how to get colspan or rowspan info in the table?
#927
tujinshu
closed
1 year ago
1
can table exacted without lines
#925
tujinshu
closed
1 year ago
0
extract table cross two pages
#922
tujinshu
closed
1 year ago
2
Incorrect extraction in tables
#921
tujinshu
closed
1 year ago
0
Incorrect row number when extract tables
#920
tujinshu
closed
1 year ago
0
Test extract
#919
michaelwnau
closed
1 year ago
0
colours have inconsistent types
#917
dhdaines
closed
11 months ago
14
Wrong extraction of nested cropped page with relative flag
#914
SS-035
closed
11 months ago
2
Incorrect extraction in tables with overlapping columns
#912
gnadlr
opened
1 year ago
20
Accessibility tagging
#909
NathanTech7713
opened
1 year ago
16
Evaluate `pypdfium` as potential replacement of `Wand` for PDF->image conversion
#906
jsvine
closed
11 months ago
1
Add `normalize_unicode=False/True` parameter to text extraction methods
#905
jsvine
opened
1 year ago
1
can i merge two or more CroppedPage into one?
#900
chopin1998
closed
1 year ago
1
How to extract table in tables ?
#898
caolf
closed
1 year ago
1
Unnecessary spaces added at the middle of word
#896
alzambranolu13
opened
1 year ago
7
Different LAParams for different zones
#893
QuentinAndre11
closed
1 year ago
1
obtaining pictures in PDF
#888
willzgr
closed
1 year ago
6
When I use extract_text and extract_words the input is empty
#887
Mankvis
closed
1 year ago
1
Multiple Tables of banded shaded rows with varying number of lines in row
#882
ramakrse
closed
1 year ago
0
Image in PDF is recognized as Table
#881
ramakrse
closed
1 year ago
2
Simply running example, but engage TypeError
#879
coolinstar
closed
1 year ago
1
Update README.md
#877
RitchieP
closed
1 year ago
1
extract_words throws unsupported operand type(s) for +: 'PSLiteral' and 'list'
#874
Laubeee
closed
1 year ago
1
Add `Page.find_table(...)`
#873
jsvine
closed
11 months ago
1
Two tables on the same page are extracted as one
#871
filips123
closed
1 year ago
0
Segmentation Fault in running tests
#869
petermr
closed
1 year ago
9
v0.9.0
#862
jsvine
closed
1 year ago
1
page.rects get wrong results
#860
NextGuido
closed
1 year ago
0
update to the correct key name
#856
weartist
closed
1 year ago
1
update to the correct key name
#855
weartist
closed
1 year ago
0
Doesn't work for rotated page
#848
Tobeabellwether
opened
1 year ago
2
How do you get the position index of the table on the page
#847
Godlikemandyy
closed
1 year ago
0
hyperlinks have negative height
#845
bentsi
opened
1 year ago
1
Is there any method to remove the header and footer of a pdf?
#843
154192
closed
1 year ago
1
dedupe_chars() method get error
#842
154192
closed
1 year ago
2
Correct placement of the `close()` method in the documentation
#835
samkit-jain
closed
1 year ago
1
Documentation of `close()`
#834
sujayvadlakonda
closed
1 year ago
2
AttributeError: 'bytes' object has no attribute 'seek'
#833
jiarongkoh
closed
1 year ago
1
Consider making `wand` optional to avoid AGPL
#832
mjbommar
closed
6 months ago
7
Possibility of text extraction using coordinates
#830
sandeepreddy5
closed
1 year ago
0
The data is in a box. How to access the data in tabular form?
#829
abhijit-z
closed
1 year ago
1
PSLiteral object using non_stroking_color from a rectangle
#828
luanmota
opened
1 year ago
6
TypeError: unsupported operand type(s) for %: 'NoneType' and 'int' when trying to access PDF page objects
#827
thefirebanks
closed
1 year ago
1
Add `repair` method?
#824
jsvine
closed
11 months ago
8
AttributeError: module 'pdfplumber' has no attribute 'load'
#821
juzj
closed
1 year ago
1
Previous
Next