alephdata / ingest-file

Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.
GNU Affero General Public License v3.0
54 stars 25 forks source link

Bump pymupdf from 1.21.1 to 1.23.26 #589

Closed dependabot[bot] closed 4 months ago

dependabot[bot] commented 5 months ago

Bumps pymupdf from 1.21.1 to 1.23.26.

Release notes

Sourced from pymupdf's releases.

PyMuPDF-1.23.26 released

PyMuPDF-1.23.26 has been released.

Wheels for Windows, Linux and MacOS, and the sdist, are available on pypi.org and can be installed in the usual way, for example:

python -m pip install --upgrade pymupdf

[Linux-aarch64 wheels will be built and uploaded later.]

Changes in version 1.23.26 (2024-02-29)

  • Fixed issues:

  • Other:

    • Improvements to table detection:

      • Improved check for empty tables, fixes bugs when determining table headers.
      • Improved computation of enveloping vector graphic rectangles.
      • Ignore more meaningless "pseudo" tables
    • Install command-line 'pymupdf' command that runs fitz/main.py.

    • Don't overwrite MuPDF's config.h when building on non-Windows.

    • Fix Story constructor's archive arg to match docs - now accepts a single Archive constructor arg.

    • Do not include MuPDF source in sdist; will be downloaded automatically when building.

PyMuPDF-1.23.25 released

PyMuPDF-1.23.25 has been released.

Wheels for Windows, Linux and MacOS, and the sdist, are available on pypi.org and can be installed in the usual way, for example:

python -m pip install --upgrade pymupdf

[Linux-aarch64 wheels will be built and uploaded later.]

Changes in version 1.23.25 (2024-02-20)

... (truncated)

Changelog

Sourced from pymupdf's changelog.

Change Log

Changes in version 1.23.26 (2024-02-29)

  • Fixed issues:

    • Fixed 3199 <https://github.com/pymupdf/PyMuPDF/issues/3199>_: Add entry_points to setuptools configuration to provide command-line console scripts
    • Fixed 3209 <https://github.com/pymupdf/PyMuPDF/issues/3209>_: Empty vertices in ink annotation
  • Other:

    • Improvements to table detection:

      • Improved check for empty tables, fixes bugs when determining table headers.
      • Improved computation of enveloping vector graphic rectangles.
      • Ignore more meaningless "pseudo" tables
    • Install command-line 'pymupdf' command that runs fitz/main.py.

    • Don't overwrite MuPDF's config.h when building on non-Windows.

    • Fix Story constructor's archive arg to match docs - now accepts a single Archive constructor arg.

    • Do not include MuPDF source in sdist; will be downloaded automatically when building.

Changes in version 1.23.25 (2024-02-20)

  • Fixed issues:

    • Fixed 3182 <https://github.com/pymupdf/PyMuPDF/issues/3182>_: Pixmap.invert_irect argument type error
    • Fixed 3186 <https://github.com/pymupdf/PyMuPDF/issues/3186>_: extractText() extracts broken text from pdf
    • Fixed 3191 <https://github.com/pymupdf/PyMuPDF/issues/3191>_: Error on .find_tables()
  • Other:

    • When building, be able to specify python-config directly, with environment variable PIPCL_PYTHON_CONFIG.

Changes in version 1.23.24 (2024-02-19)

  • Fixed issues:

    • Fixed 3148 <https://github.com/pymupdf/PyMuPDF/issues/3148>_: Table extraction - vertical text not handled correctly
    • Fixed 3179 <https://github.com/pymupdf/PyMuPDF/issues/3179>_: Table Detection: Incorrect Separation of Vector Graphics Clusters
    • Fixed 3180 <https://github.com/pymupdf/PyMuPDF/issues/3180>_: Cannot show optional content group: AttributeError: module 'fitz.mupdf' has no attribute 'pdf_array_push_drop'
  • Other:

    • Be able to test system install using sudo pip install instead of a venv.

... (truncated)

Commits
  • a868c0a Update changelog, version numbers and dates for release 1.23.26.
  • 9552f5e Fix bug in Page.add_ink_annot().
  • aae7d1b setup.py: do not include MuPDF .tgz in sdist to reduce its size.
  • df4139a Fix Story constructor's archive arg to match docs.
  • 3b0ab8d setup.py: Only overwrite MuPDF's include/mupdf/fitz/config.h on Windows.
  • ad3f92b Immunize against empty table headers
  • c41f831 changes.txt: update.
  • 6acf394 tests/test_font.py: added test_mupdf_subset_fonts2().
  • 20f35ac setup.py tests/: install a pumupdf command that runs fitz/main.py.
  • 5caba13 pipcl.py: added support for entry_points.
  • Additional commits viewable in compare view


Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
dependabot[bot] commented 4 months ago

Superseded by #604.