VIDA-NYU / ache

ACHE is a web crawler for domain-specific search.
http://ache.readthedocs.io
Apache License 2.0
449 stars 135 forks source link

Bump tika-parsers from 1.18 to 2.4.0 #259

Closed dependabot[bot] closed 2 years ago

dependabot[bot] commented 2 years ago

Bumps tika-parsers from 1.18 to 2.4.0.

Changelog

Sourced from tika-parsers's changelog.

Release 2.4.1 - ???

  • Fixed bug in ordering of Parsers during service loading (TIKA-3750).

  • Users can expand system properties from the forking process into forked tika-server processes (TIKA-3748).

Release 2.4.0 - 04/23/2022

  • NOTE: To save on resources, we no longer include the deeplearning4j dependencies in the tika-dl jar. The dependencies for the tika-dl package must be provided by users. See: https://github.com/apache/tika/blob/main/tika-parsers/tika-parsers-ml/tika-dl/pom.xml for the dependencies that must be provided at run-time (TIKA-3676).

  • NOTE: Added prefix "dwg-custom:" to DWG custom metadata properties (TIKA-3731).

  • Add initial, BETA-grade TLS encryption option for tika-server; configuration may change in future releases (TIKA-3719).

  • Allow specification of fetcherName and fetchKey via query parameters in request URI in tika-server (TIKA-3714).

  • Add basic parsers for WARC and WACZ in tika-parsers-standard (TIKA-3697).

  • Add MetadataWriteFilter capability to improve memory profile in Metadata objects (TIKA-3695).

  • Allow configurability of the ContentHandlerDecorator used by the AutoDetectParser (TIKA-3723).

  • Allow configurability of the EmbeddedDocumentExtractor used by the AutoDetectParser (TIKA-3711).

  • Add detection for Frictionless Data packages and WACZ (TIKA-3696).

  • Add detection for DGN files with gratitude and credit to Steven Frew's tika-dgn-detector (TIKA-3721).

  • Add parser for metadata from DGN 8 files via Dan Coldrick (TIKA-3721).

  • Add a fetcher and emitter for Azure blob storage (TIKA-3707).

  • Add detection for files encrypted by Microsoft's Rights Management Service (TIKA-3666).

  • Fixed regression in 2.3.0 that led to more embedded filenames than appropriate being written to the content (TIKA-3711).

  • tika-server now clones forking process' environment variables

... (truncated)

Commits
  • 740f70b [maven-release-plugin] prepare release 2.4.0-rc1
  • 8faf9ae roll back version for second attempt at 2.4.0-rc1
  • fad145d need to switch tasks -> target for major release upgrade for antrun
  • c14de4f Merge remote-tracking branch 'origin/main' into main
  • 7d3911e TIKA-3743 -- install
  • 6e9d5c0 [maven-release-plugin] prepare for next development iteration
  • 3677bf6 [maven-release-plugin] prepare release 2.4.0-rc1
  • d189d7b Update CHANGES.txt for X.Y.Z release.
  • 3b322e9 fix workflow names for branch_1x
  • bbd8edd Merge remote-tracking branch 'origin/main' into main
  • Additional commits viewable in compare view


Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
dependabot[bot] commented 2 years ago

Superseded by #285.