-
### Version
PyPDFForm=4.3.1
### Issue Description
Hello, I'm trying to fill the attached PDF. I've included the input PDF and the output using the preview stream. I am trying to fill in the fie…
-
This project uses RapidOCR for image OCR and Fitz in the PyMuPDF package for PDF OCR. To be honest, it is extremely difficult to recognize tables in some PDFs, especially in scholarly papers. Therefor…
-
**Describe the bug**
The pypdf_table_extraction version info from the cli / library, does'nt match the pypi version.
(or toml file)
**Steps to reproduce the bug**
install this libr…
-
## Issue Description
If I enable Table-Sorting, the headline of the table in a note exported as a PDF will have black background
## Steps to Reproduce
Please provide a detailed list of steps …
-
## Bug Report
### What happened?
We tried to open PDF (LCP) file both in our app and test app. We were able to successfully open it. However, we experienced an empty Table of Contents . We tri…
-
**Describe the bug**
I am submitting a File containing the tabular data with handwritten text in it with model openai gpt-4o, but when being submitted as an image (jpeg, png etc) it gives accurate re…
-
I'm using Premium Mode to parse complex technical pdfs, with parsing instructions to break the table down by row with labels. It does it properly for the first couple rows of each table, but then sto…
-
I would also like to point out that, the latest version of the library does not reflect as a valid library in VSCode. I had to manually install version 0.7.2 and upgrade it.
The code:
```
import…
-
### Contact Details
_No response_
### Is your feature request related to a problem? Please describe?
The items as they are appearing now in the PDF are somewhat random sorted. Given the example of …
apvlv updated
3 weeks ago
-
Originally opened this as a discussion, but after getting into the code, it appears to be an issue that impacts the extraction of not only tables but also images with text on them.
The problem is …