metachris / pdfx

Extract text, metadata and references (pdf, url, doi, arxiv) from PDF. Optionally download all referenced PDFs.
http://www.metachris.com/pdfx
Apache License 2.0
1.03k stars 113 forks source link

Point pdf links to local files downloaded - feature request #56

Open maguilella opened 2 years ago

maguilella commented 2 years ago

Is there any possibility that the original pdf file be modified to make the original link to point to the locally downloaded files? A second, more interesting option would be to combine all pdf's in a single one and change every link to point internally at the specified page. That would be interesting for example to save documents (PhD Tesis, Master Tesis, etc) in a single document that can be saved for long time without losing content.

fidian commented 2 years ago

I'm also looking for a way to rewrite URLs in a PDF file, though my use case would be to update links to point to a new location only. Any tool or method that does this would be wonderful.