alephdata / ingest-file

Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.
GNU Affero General Public License v3.0
54 stars 26 forks source link

Bump pymupdf from 1.21.1 to 1.24.9 #645

Open dependabot[bot] opened 1 month ago

dependabot[bot] commented 1 month ago

Bumps pymupdf from 1.21.1 to 1.24.9.

Release notes

Sourced from pymupdf's releases.

PyMuPDF-1.24.9 released

PyMuPDF-1.24.9 has been released.

Wheels for Windows, Linux and MacOS, and the sdist, are available on pypi.org and can be installed in the usual way, for example:

python -m pip install --upgrade pymupdf

[Linux-aarch64 wheels will be built and uploaded later.]

Changes in version 1.24.9 (2024-07-24)

  • Incremented MuPDF version to 1.24.8.

PyMuPDF-1.24.8 released

PyMuPDF-1.24.8 has been released.

Wheels for Windows, Linux and MacOS, and the sdist, are available on pypi.org and can be installed in the usual way, for example:

python -m pip install --upgrade pymupdf

[Linux-aarch64 wheels will be built and uploaded later.]

Changes in version 1.24.8 (2024-07-22)

Other:

  • Fixed various spelling mistakes spotted by codespell.
  • Improved how we modify MuPDF's default configuration on Windows.
  • Make text search to work with ligatures.

PyMuPDF-1.24.7 released

PyMuPDF-1.24.7 has been released.

Wheels for Windows, Linux and MacOS, and the sdist, are available on pypi.org and can be installed in the usual way, for example:

python -m pip install --upgrade pymupdf

... (truncated)

Changelog

Sourced from pymupdf's changelog.

Change Log

Changes in version 1.24.9 (2024-07-24)

  • Incremented MyPDF version to 1.24.9.

Changes in version 1.24.8 (2024-07-22)

  • Fixed issues:

    • Fixed 3636 <https://github.com/pymupdf/PyMuPDF/issues/3636>_: API documentation for the open function is not obvious to find.
    • Fixed 3654 <https://github.com/pymupdf/PyMuPDF/issues/3654>_: docx parsing was broken in 1.24.7
    • Fixed 3677 <https://github.com/pymupdf/PyMuPDF/issues/3677>_: Unable to extract subset font name using the newer versions of PyMuPDF : 1.24.6 and 1.24.7.
    • Fixed 3687 <https://github.com/pymupdf/PyMuPDF/issues/3687>_: Page.get_text results in AssertionError for epub files

Other:

  • Fixed various spelling mistakes spotted by codespell.
  • Improved how we modify MuPDF's default configuration on Windows.
  • Make text search to work with ligatures.

Changes in version 1.24.7 (2024-06-26)

  • Fixed issues:

    • Fixed 3615 <https://github.com/pymupdf/PyMuPDF/issues/3615>_: Document.pagemode or Document.pagelayout crashes for epub files
    • Fixed 3616 <https://github.com/pymupdf/PyMuPDF/issues/3616>_: not last version reported

Changes in version 1.24.6 (2024-06-25)

  • Fixed issues:

    • Fixed 3599 <https://github.com/pymupdf/PyMuPDF/issues/3599>_: Story.fit_width() has a weird line
    • Fixed 3594 <https://github.com/pymupdf/PyMuPDF/issues/3594>_: Garbled extraction for Amazon Sustainability Report
    • Fixed 3591 <https://github.com/pymupdf/PyMuPDF/issues/3591>_: 'width' in Page.get_drawings() returns width equal as 0
    • Fixed 3561 <https://github.com/pymupdf/PyMuPDF/issues/3561>_: ZeroDivisionError: float division by zero with page.apply_redactions()
    • Fixed 3559 <https://github.com/pymupdf/PyMuPDF/issues/3559>_: SegFault 11 when empty H1 H2 H3 H4 etc element is used in insert_htmlbox
    • Fixed 3539 <https://github.com/pymupdf/PyMuPDF/issues/3539>_: Add dotted gridline detection to table recognition
    • Fixed 3519 <https://github.com/pymupdf/PyMuPDF/issues/3519>_: get_toc(simple=False) AttributeError: 'Outline' object has no attribute 'rect'
    • Fixed 3510 <https://github.com/pymupdf/PyMuPDF/issues/3510>_: page.get_label() gets wrong label on the first page of doc
    • Fixed 3494 <https://github.com/pymupdf/PyMuPDF/issues/3494>_: 1.24.2/1.24.3: spurious characters introduced when using subset_fonts and insert_pdf
    • Fixed 3470 <https://github.com/pymupdf/PyMuPDF/issues/3470>_: subset_fonts error exit without exception/warning
    • Fixed 3400 <https://github.com/pymupdf/PyMuPDF/issues/3400>_: set_toc alters link coordinates for some rotated pages on pymupdf 1.24.2
    • Fixed 3347 <https://github.com/pymupdf/PyMuPDF/issues/3347>_: Incorrect links to points on pages having different heights
    • Fixed 3237 <https://github.com/pymupdf/PyMuPDF/issues/3237>_: Set_metadata() does not work

... (truncated)

Commits
  • 9717935 Update changelog, version numbers and release dates for release 1.24.9.
  • 0f45692 changes.txt: fixed formatting of 1.24.8 description.
  • c3bed87 src/init.py: fix to work with latest mupdf 1.24.8.
  • e4420d1 setup.py: fix generation of mupdf header tree.
  • 015adf1 READMEb.md READMEd.md: removed unnecessary version numbers.
  • 0cc33da setup.py: force creation of new pymupdfb wheels.
  • 2216b7b Update changelog, version numbers and release dates for release 1.24.8.
  • 573fcd0 pipcl.py: use sysconfig.get_platform() instead of setuptools.distutils.util.g...
  • f49f756 src/init.py: log(): make slightly more robust.
  • 550411c Documentation: Updates sidebar logo to use SVG.
  • Additional commits viewable in compare view


Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)