DDS-GmbH / jocument

JVM based template engine for DOCX and XLSX.
MIT License
3 stars 1 forks source link

Bump tika-core from 1.26 to 2.1.0 #61

Closed dependabot[bot] closed 2 years ago

dependabot[bot] commented 2 years ago

Bumps tika-core from 1.26 to 2.1.0.

Changelog

Sourced from tika-core's changelog.

Release 2.1.1 - ???

  • Improve robustness and features of the httpfetcher (TIKA-3543)

  • Add optional fetch ranges to FetchEmitTuple to allow range fetching from, e.g. http or s3 (TIKA-3542).

  • Exclude dependencies on jsoup and ehcache in ucar grib/cdm (TIKA-3003).

Release 2.1.0 - 08/18/2021

MAJOR CHANGES in 2.1.0:

  • Improved packaging for tika-parsers-extended. Use the tika-parser-scientific-package and tika-parser-sqlite3-package artifacts if you want fat jars with dependencies. (TIKA-3510)

  • Tika app writes UTF-8 when an encoding is not specified; the legacy behavior was UTF-8 on Mac OS, but System default on other OSs (TIKA-3515).

  • Change the default rendering strategy for PDFs from NO_TEXT to ALL (TIKA-3520).

Other changes:

  • Fixed bug that pointed to the wrong tessdata directory if the user specified a tesseract path but not also a tessdata path (TIKA-3518).

  • Fixed bug in Icu4j's encoding detector where it would return non-standard names for charsets, e.g. IBM424_rtl is now returned as IBM424 (TIKA-3516).

  • Add a simple UrlFetcher in tika-core as a basic alternative to tika-fetcher-http (TIKA-3527).

  • Add tika-pipes support for Google Cloud Storage (TIKA-3524).

  • Fix markup ordering errors in xhtml output for ODT files (TIKA-2242).

  • Fix serialization of embedded docs in OpenSearch emitter and fix embedded documents not being indexed in some use cases in the Solr emitter (TIKA-3490).

  • Add pipesClientId system property to PipesServer so that each forked process can log to its own logger (TIKA-3480).

  • Add DateNormalizingMetadataFilter let users ensure that all dates emitted to Solr/OpenSearch are in UTC. Users can configure which timezone they'd like to use in cases where the file format does not store a timezone (TIKA-3496).

  • Breaking change in the Solr and OpenSearch emitters. To achieve

... (truncated)

Commits


Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot will merge this PR once CI passes on it, as requested by @AntonOellerer.


Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
AntonOellerer commented 2 years ago

@dependabot merge

github-actions[bot] commented 2 years ago

Unit Test Results

  2 files  ±0    2 suites  ±0   4s :stopwatch: ±0s 12 tests ±0  12 :heavy_check_mark: ±0  0 :zzz: ±0  0 :x: ±0 

Results for commit 47e4c8b6. ± Comparison against base commit 47e4c8b6.

:recycle: This comment has been updated with latest results.