-
I want to parse a huge, but regular document. Tabula correctly parses the 1st page, but using `--pages all` causes it to consume a huge amount of memory as, I guess, it tries to do processing consider…
-
I have 2 machines. One of which is running Debian 11 and `cups-filters-core-drivers 1.28.7-1+deb11u2`, the other is running Debian 12 and `cups-filters-core-drivers 1.28.17-3`. Connected to both I hav…
-
Für OPUS gibt es Anwenderwünsche, einem PDF beim Download ein Cover PDF voranzustellen, welches z.B. die Nutzungsbedingungen enthält (siehe Projekt [Automatische Deckblätter](https://github.com/OPUS4/…
-
```
In an attempt to use pdfsizeopt to find a "normalized" or "canonical"
representation of PDF files for potential deduplication during backups (or even
for the sake of privacy), it would be nice i…
-
デジタル・フォレンジックとその関連プロセスについて学び、実践的な例で実験する。
Task1 Introduction To Digital Forensics
============================
・フォレンジックとは、犯罪を調査し、事実を確定するために科学を応用することです。コンピュータやスマートフォンなどのデジタルシステムの普及に伴い、犯罪を捜査するための…
-
```
In an attempt to use pdfsizeopt to find a "normalized" or "canonical"
representation of PDF files for potential deduplication during backups (or even
for the sake of privacy), it would be nice i…
-
### Describe the bug
```
Scanning contents ━━━━━━━━━━━━━━━━━━╺━━━━━━━━━━━━━━━━━━━━━ 45% 40/88 0:00:01
An exception occurred while executing the pipeline _common.py:284
…
-
Hi @ffalt thank you a lot for this project. I have successfully been using your `extractBuffer` function in a browser environment.
Working with pdfjs-dist V4.0.269 I noticed that the y coordinate i…
-
The Qucs help menu has always shown just the file name for the PDF help files; I thought it would be nice to actually show the file *Subject*, extracted from the PDF file metadata.
That means inste…
-
```
In an attempt to use pdfsizeopt to find a "normalized" or "canonical"
representation of PDF files for potential deduplication during backups (or even
for the sake of privacy), it would be nice i…