alephdata / ingest-file

Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.
GNU Affero General Public License v3.0
54 stars 25 forks source link

Bump pymupdf from 1.21.1 to 1.23.5 #548

Closed dependabot[bot] closed 8 months ago

dependabot[bot] commented 9 months ago

Bumps pymupdf from 1.21.1 to 1.23.5.

Release notes

Sourced from pymupdf's releases.

PyMuPDF-1.23.5 released

PyMuPDF-1.23.5 has been released.

Wheels for Windows, Linux and MacOS, and the sdist, are available on pypi.org and can be installed in the usual way, for example:

python -m pip install --upgrade pymupdf

Changes in version 1.23.5 (2023-10-11)

  • Bug fixes:

  • Other:

    • Use MuPDF-1.23.4.
    • Fix optimisation flags with system installs.
    • Fixed the problem that the clip parameter does not take effect during table recognition
    • Support Pillow mode "RGBa"
    • Support extra word delimiters
    • Support checking valid PDF name objects

PyMuPDF-1.23.4 released

PyMuPDF-1.23.4 has been released.

Wheels for Windows, Linux and MacOS, and the sdist, are available on pypi.org and can be installed in the usual way, for example:

python -m pip install --upgrade pymupdf

Changes in version 1.23.4 (2023-09-26)

  • Improved build instructions.

  • Fixed Tesseract in rebased implementation.

  • Improvements to build/install with system MuPDF.

  • Fixed Pyodide builds.

  • Fixed rebased bug in _insert_image().

  • Bug fixes:

... (truncated)

Changelog

Sourced from pymupdf's changelog.

Change Log

Changes in version 1.23.5 (2023-10-11)

  • Bug fixes:

    • Fixed 2341 <https://github.com/pymupdf/PyMuPDF/issues/2341>_: Handling negative values in the zoom section for LINK_GOTO in linkDest
    • Fixed 2522 <https://github.com/pymupdf/PyMuPDF/issues/2522>_: Typo in set_layer() - NameError: name 'f' is not defined
    • Fixed 2548 <https://github.com/pymupdf/PyMuPDF/issues/2548>_: Fitz freezes on some PDFs when calling the fitz.Page.get_text_blocks method.
    • Fixed 2596 <https://github.com/pymupdf/PyMuPDF/issues/2596>_: save(garbage=3) breaks get_pixmap() with side effect
    • Fixed 2635 <https://github.com/pymupdf/PyMuPDF/issues/2635>_: "clean=True" makes objects invisible in the pdf
    • Fixed 2637 <https://github.com/pymupdf/PyMuPDF/issues/2637>_: Page.insert_textbox incorrectly handles the last word if it starts a new line
    • Fixed 2699 <https://github.com/pymupdf/PyMuPDF/issues/2699>_: extract paragraph with below table
    • Fixed 2703 <https://github.com/pymupdf/PyMuPDF/issues/2703>_: Wrong fontsize calculation in corner cases ("page.get_texttrace()")
    • Fixed 2710 <https://github.com/pymupdf/PyMuPDF/issues/2710>_: page.rect and text location wrong / differing from older version
    • Fixed 2723 <https://github.com/pymupdf/PyMuPDF/issues/2723>_: When will a Python 3.12 wheel be available?
    • Fixed 2730 <https://github.com/pymupdf/PyMuPDF/issues/2730>_: persistent get_text() formatting
  • Other:

    • Use MuPDF-1.23.4.
    • Fix optimisation flags with system installs.
    • Fixed the problem that the clip parameter does not take effect during table recognition
    • Support Pillow mode "RGBa"
    • Support extra word delimiters
    • Support checking valid PDF name objects

Changes in version 1.23.4 (2023-09-26)

  • Improved build instructions.

  • Fixed Tesseract in rebased implementation.

  • Improvements to build/install with system MuPDF.

  • Fixed Pyodide builds.

  • Fixed rebased bug in _insert_image().

  • Bug fixes:

    • Fixed 2556 <https://github.com/pymupdf/PyMuPDF/issues/2556>_: Segmentation fault at caling get_cdrawings(extended=True)
    • Fixed 2637 <https://github.com/pymupdf/PyMuPDF/issues/2637>_: Page.insert_textbox incorrectly handles the last word if it starts a new line
    • Fixed 2683 <https://github.com/pymupdf/PyMuPDF/issues/2683>_: Windows sdist build failure - non-quoting of path and using UNIX which command
    • Fixed 2691 <https://github.com/pymupdf/PyMuPDF/issues/2691>_: Page.get_textpage_ocr() bug in rebased fitz_new version
    • Fixed 2692 <https://github.com/pymupdf/PyMuPDF/issues/2692>_: Page.get_pixmap(clip=Rect()) bug in rebased fitz_new version

Changes in version 1.23.3 (2023-08-31)

  • Fixed use of Tesseract for OCR.

... (truncated)

Commits
  • 394ab53 pipcl.py: fix finding python-config on macos python-3.12 builds.
  • 585d3e3 .github/workflows/build_wheels.yml: install python-3.12 on windows.
  • 2928529 Fix typos
  • daeb793 Improve Japanese doc for better translations
  • 8a7eba2 .github/workflows/build_wheels.yml: Fix problems building for MacOS on github.
  • 5e83357 Update versions, dates and changelog for 1.23.5.
  • 37a5e59 setup.py: avoid problems caused by name of git clone.
  • b4a1d23 tests/test_2548.py: avoid hang on old mupdf.
  • d0a553e pipcl.py: allow fn_sdist to specify a different name of file within the sdist.
  • 66c51c4 scripts/gh_release.py: allow building with mupdf from git.
  • Additional commits viewable in compare view


Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
dependabot[bot] commented 8 months ago

Superseded by #554.