Closed ashrockd closed 6 days ago
i'm looking for a flag/option that goes something like
ocrmypdf --write-attributes input.pdf output.pdf
The default behavior is that output.pdf will have a last modified date set to the time ocrmypdf finishes. The file system, the DocumentInfo dictionary, and the XMP metadata will have the same date set.
If you have pikepdf older than 9.0.0 installed, you may see different behavior (see https://github.com/pikepdf/pikepdf/issues/595), or if for some reason the operating system does not permit you to change timestamps.
Describe the proposed feature
I am indexing the scanned files. I indexed them once before OCR. Now after OCR i want to have their content indexed, however the utility (docfetcher) that I am using simply ignore the already indexed file as it doesn't know that they have been modified with an added text layer.
Is it possible for ocrmypdf to have a flag to toggle the last-modified/written attribute.
here is the discussion on docfetcher forum