-
Hi,
I am trying to extract all words/text as well as the co-ordinates of each word using pdfminer from filled in PDF forms that are no longer editable (i.e. they are flattened and NOT acroforms). I…
-
**Feature request**
- A description of the feature you would like to have
I suggest to add the import statements to the tutorial sections.
- If relevant, the context that you are in. What are…
-
Um den Workflow in Gesundheitsämtern erheblich zu erleichtern, sollen die Meldungen, die deutschlandweit einheitlich über das [Formular](https://www.rki.de/DE/Content/Infekt/IfSG/Meldeboegen/Meldung_L…
-
**Bug report**
How to use section says to run the script [like this](https://github.com/pdfminer/pdfminer.six/blob/develop/README.md#how-to-use): `python pdf2txt.py ...`. However after installing i…
Bouke updated
4 months ago
-
Describe the bug
----------------
I've got a PDF sample where clamav is not able to extract the text, while `pdftotext` (https://poppler.freedesktop.org) and `pdf2txt.py` (https://pdfminersix.rea…
-
Hai,
Thank you for providing a beautiful library.
Actually, I am trying to extract portion of text with respect to the heading like in the sample pdf file we select the heading `ABSTRACT` so as outp…
-
### Describe the issue:
from .numpy_ops import NumpyOps
ImportError: DLL load failed while importing numpy_ops: The specified module could not be found.
### Reproduce the code example:
```python
i…
-
Is there a way to to prevent pdfminer.six from executing the layout algorithm? So that one only gets a list of lines/graphics/image elements etc.. I have several PDFs where the layout algorithm takes…
yeus updated
7 months ago
-
- A description of the bug
Try to extract image from https://raft.github.io/raft.pdf, follow https://pdfminersix.readthedocs.io/en/latest/howto/images.html, but image in output-dir all empty.
- …
zd3tl updated
8 months ago
-
Pdfminer is a python package to convert pdf to text and other formats.
Pdfminer is not actively maintained. There's a similar package name pdfminer.six that is community maintained.
Check out https:…