internetarchive / archive-pdf-tools

Fast PDF generation and compression. Deals with millions of pages daily.
https://archive-pdf-tools.readthedocs.io/en/latest/
GNU Affero General Public License v3.0
86 stars 13 forks source link

Support recompressing existing PDFs without hOCR files and without touching the text input #28

Open MerlijnWajer opened 2 years ago

MerlijnWajer commented 2 years ago

This would be quite helpful for OCRmyPDF users if they wanted to aggressively compress their PDFs after OCRmyPDF has done its work, see https://github.com/jbarlow83/OCRmyPDF/issues/541