pymupdf / PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
https://pymupdf.readthedocs.io
GNU Affero General Public License v3.0
5.17k stars 495 forks source link

Document suppressing line breaks #3879

Open JorjMcKie opened 1 week ago

JorjMcKie commented 1 week ago

This changes the documentation to point to improved plain text output. Also included is an example for combining this improvement with PyMuPDF4LLM utilities.

@jamie-lemon please let me know if you think that more examples should be added ...