-
Is there a way to include arbitrary files as attachment? In IText, this could by done using `com.lowagie.text.pdf.PdfFileSpecification.fileEmbedded`. Does anyone know if this can be done with PDFBOX a…
-
Implement new basic previewers
```[tasklist]
### Tasks
- [ ] better text previewer (current one shows all the text in one narrow column)
- [ ] csv (ability to sort?)
- [ ] json
- [ ] pdf
- [ …
-
I'm trying to parse a PDF using the example, but parsing a small 209 kb file requires more than 5 seconds.
```
using namespace docwire;
std::stringstream out_stream;
std::filesystem::path("D:\\pdf…
-
### Description of the bug
I want to remove all texts and only keep vector graphics (such as straight lines) in PDF, the code and result are shown below.
However, the original PDF does not contain…
-
Hello,
My PDF file contains long tables, and the tables include images. I tried
`md_text = pymupdf4llm.to_markdown("input.pdf", write_images=True)`
and the result was that the images were ext…
-
Use sample pdf https://mozilla.github.io/pdf.js/web/viewer.html or https://mozilla.github.io/pdf.js/legacy/web/viewer.html .
Steps to reproduce the problem:
1. In mobile device, I was using Samsun…
-
I was directed to this repo in order to make a request for an nth fragment selector.
The problem I am trying to solve is to be able to select and update the CSS of tables split across PDF pages. Fo…
-
**Describe the bug**
When I use the pdftotext plugin to convert PDFs, the result obtained is completely distorted compared to the original PDF. However, I noticed that by passing the "-layout" parame…
-
### Steps to reproduce
1. Enable pdf editing
2. Set 'add highlight to file directly'
3. Highlight a passage
### Expected behavior
The pdf is saved without any changes apart from the highli…
-
Need to parse pdf to text. So a parser is needed. Is working on a parser for android in these days.