-
For example
https://www.iuoe399.org/media/filer_public/45/77/457700c9-dd70-4cfc-be49-a81cb3fba0a6/2020_lu399_primary_endorsement.pdf
This issue requires the installation of pdfminer.six on the p…
-
I'm using pdfplumber to extract text from pdf pages using the crop function. It works fine on the local system but when deploying the project in a docker container it gives the Error: `'LTChar' object…
-
**Bug report**
When passing `boxes_flow` as `None`, we don't run the full advanced layout analysis, but rather the order of text boxes will depend on their position on the page only. This is intent…
-
### Current functionality
> If characters are on the same line but not part of the same word, an intermediate space is inserted.
[https://pdfminersix.readthedocs.io/en/latest/api/composable.html#l…
-
I'm feel so stupid to say this question but i have no result after a lot of test.
I trying to create a pattern image representing textes positions with rectangles but when i use bbox coordonates, i h…
-
Per documentation: https://pdfminersix.readthedocs.io/en/latest/reference/highlevel.html#api-extract-text
```
extract_text
pdfminer.high_level.extract_text(pdf_file, password='', page_numbers=None,…
-
I do have a proper working code for this situation, but we have to make sure [pdf miner](https://pypi.org/project/pdfminer/) is installed.
I do not intend to put pdfminer into requirements, as it's…
-
## MY python3.75 code:
`pdf = pdfplumber.open(r'abcd.pdf')
p0 = pdf.pages[0]
text = p0.extract_text()
print(text)
tables= p0.extract_tables()
print(tables)
`
the text result is None
the tab…
-
## Describe the bug
The table is extracted correctly in visualization, but the text in the table is not extracted.
## Code to reproduce the problem
The code is as follows:
pdf = pdfplumber…
-
## What are you trying to do?
I am using pdfplumber to look for 12 digit strings in a PDF. My code worked when the font was Helvetica, but stopped working when I changed font to stsong-light
#…