-
Govdocs -
[000899.pdf](https://github.com/trailofbits/polyfile/files/4882183/000899.pdf)
[001940.pdf](https://github.com/trailofbits/polyfile/files/4882184/001940.pdf)
```
Parsing PDF obj 62 …
-
Hello Team,
Really like your work on LlamaParse. The web app is working fine for PDF parsing but not the package.
Even when a new cloud API key is used, I got the same error.
```python
import…
-
var pdfReader = hummus.createReader(sourcePath);
pageNumber=pdfReader.getPagesCount()
-
```
What steps will reproduce the problem?
1. in applet use method appendPDF and specify not existing URL
2. jzebraDoneAppending() will fire, but there is no exception in
applet.getException() (you …
-
Hello,
Thanks a lot for the huge work on unstructured !
I would love to visualize with a **progress bar the advancement of partition_pdf when parsing big pdfs.**
Is there a easy way of doing …
-
from qanything_kernel.utils.loader.self_pdf_loader import PdfLoader
pdf_loader = PdfLoader(filename='tables/table-03d9ec345317b0115180d7dbcf843ef6.pdf')
markdown_directory = pdf_loader.load_to_markd…
-
Ahoj,
v MZK sme narazili na bug pri generovaní PDF. Momentálne máme nastavený limit pre generovanie PDF 200 stránok. Keď je užívateľ neprihlásený, tak to funguje ako má. K chybe dochádza až keď sa …
-
Are there any alternatives to GROBID and would there be any major advantages in using them?
### Alternatives (feel free to add new entries)
- https://github.com/pdfminer/pdfminer.six
- https://gi…
-
The pdf parsing of https://homepages.cwi.nl/~lex/files/dict.pdf doesn't look very appealing.
Thinks i already noticed
- No TOC display (and strange header size detection, see #21)
- Characters a…
-
Use pdf.js to manually parse an arbitrary pdf currently open