pymupdf / PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
https://pymupdf.readthedocs.io
GNU Affero General Public License v3.0
5.17k stars 495 forks source link

pdf citation #3534

Closed Surajlambor closed 4 months ago

Surajlambor commented 4 months ago

I love using this library for PDF purposes. Now, I'm building a PDF summarization tool. For citations, we need the PDF name and the page number of the retrieved answer. Do you have any ideas on how I can accomplish this? Please help with this