issues
search
jsvine
/
pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
MIT License
6.1k
stars
625
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
TypeError: unsupported operand type(s) for %: 'NoneType' and 'int' when trying to access PDF page objects
#827
thefirebanks
closed
1 year ago
1
Add `repair` method?
#824
jsvine
closed
1 year ago
8
AttributeError: module 'pdfplumber' has no attribute 'load'
#821
juzj
closed
1 year ago
1
Extracting tables with `explicit_vertical_lines` returns more columns than expected
#820
erip
closed
1 year ago
5
pdfplumber extracting wrong text from pdf
#815
siddhantajain
closed
1 year ago
1
fix: calculate rotation using 0 when Rotate is None
#811
toshi1127
closed
1 year ago
5
Extract tables randomly moves the bbox 20 pixels away
#810
merionum
closed
1 year ago
1
find_tables cost several minutes for one page
#807
buptyyf
closed
1 year ago
5
Not reading the pdf file
#803
drnko
closed
1 year ago
0
are the default config for extracting text/tables the best ones?
#801
sergenti
closed
1 year ago
0
Wrong coordinates of words when using function extract_words()
#799
datdao1998
closed
1 year ago
6
to_image doesn't accept parameter "width"
#798
pseudomonas
closed
1 year ago
6
AttributeError: partially initialized module 'pdfplumber' has no attribute 'open' (most likely due to a circular import)
#796
amanrajpoot101
closed
1 year ago
1
Bug of the extract_table() function
#795
Jack251970
closed
1 year ago
1
Wrong Ordering of RTL Text And Table Extractions
#794
ramzitannous
opened
1 year ago
3
fix typing hints to include io.BytesIO
#791
conitrade-as
closed
1 year ago
2
Cannot .to_image() a FilteredPage class instance.
#784
jamiejcole
closed
1 year ago
2
No spaces extracted by chars
#780
mon-hur
closed
1 year ago
5
Too many edges detected
#779
shangrilar
closed
1 year ago
1
Update README.md
#774
wltrhslu
closed
1 year ago
2
It might be character set problem
#766
Gadil-1987
closed
1 year ago
1
Getting 'pdfplumber' has no attribute 'open' Error in latest version.
#765
Mayank-21
closed
1 year ago
2
Plumbing pdf results in mixed characters of neighbouring words
#764
XuShanJiang
closed
1 year ago
7
Some additional documentation in `pdfplumber.Page.to_image(**conversion_kwargs)`?
#760
jwestwsj
closed
1 year ago
7
Extract digital signatures from pdf
#756
FaizanCW
opened
1 year ago
3
Inclusion of a CITATION.cff file
#755
joaoccruz
closed
1 year ago
4
Reding different words as continuous one word.
#752
ameymn
closed
1 year ago
2
New Problem
#745
Godlikemandyy
closed
1 year ago
1
Detecting paragraphs or blank lines inside a table
#733
Cristishor201
closed
1 year ago
1
Update README.md
#732
tjex
closed
1 year ago
2
How to get information of superscript and subscript text in the pdf
#731
sarveshkrg
closed
1 year ago
1
TypeError: unsupported operand type(s) for -: 'float' and 'NoneType'
#726
loganathanspr
closed
1 year ago
4
Distinguish between bold and non-bold Fonts
#724
lycfight
opened
1 year ago
6
The same table is distributed on two pages, and some data extraction fails
#720
AresElvis
closed
1 year ago
0
Information about how to display without Jupyter
#716
josephernest
closed
1 year ago
3
pdf plumber to_image( ) OSError: exception: access violation writing 0x0000000000000008
#713
jjjkuba
closed
1 year ago
5
automatically make this space read as ""
#709
yihaoshumi
closed
1 year ago
0
Mypy compatibility
#703
jhonatan-lopes
closed
1 year ago
3
Extracting Z-Value of Rects/Items
#700
JosefJoubert
closed
1 year ago
1
AttributeError: partially initialized module 'pdfplumber' has no attribute 'open' (most likely due to a circular import)
#699
lili1234567890
closed
1 year ago
6
pip whl missing `py.typed`
#698
jhonatan-lopes
closed
1 year ago
6
ValueError: bytes must be in range(0, 256)` in page.chars
#695
bpugnaire
closed
2 years ago
1
AttributeError: 'LTChar' object has no attribute 'graphicstate' trying to use the table function
#692
Da-vid21
closed
2 years ago
2
_itemgetter function removed from utils.py without deprecationWarning
#691
jfuruness
closed
2 years ago
6
Handle `ValueError` exception when searching for text using regex
#687
samkit-jain
closed
2 years ago
2
How get merged cells
#685
hbh112233abc
opened
2 years ago
9
Page.search results bbox position can be wrong
#684
bpugnaire
closed
2 years ago
4
Page.search ValueError: min() arg is an empty sequence
#683
bpugnaire
closed
2 years ago
5
Consider punctuation when extracting words
#682
lolipopshock
closed
2 years ago
3
pdfplumber will be hung up when open pdf which is damaged
#681
Gadil-1987
closed
2 years ago
8
Previous
Next