Add MetadataWriteFilter capability to improve memory profile in
Metadata objects (TIKA-3695).
Allow specification of fetcherName and fetchKey via query parameters
in request URI in tika-server (TIKA-3714).
Add basic parsers for WARC and WACZ in tika-parsers-standard (TIKA-3697).
Add detection for Frictionless Data packages and WACZ (TIKA-3696).
Add a fetcher and emitter for Azure blob storage (TIKA-3707).
Allow configurability of the EmbeddedDocumentExtractor used
by the AutoDetectParser (TIKA-3711).
Fixed regression in 2.3.0 that led to more embedded filenames
than appropriate being written to the content (TIKA-3711).
tika-server now clones forking process' environment variables
into forked process (TIKA-3715).
Add an optional /eval endpoint for tika-eval profile or compare
capabilities in tika-server (TIKA-3689).
Add a Parsed-By-Full-Set metadata item to record all parsers that processed
a file (TIKA-3716).
Add metadata filters for Optimaize and OpenNLP language detectors (TIKA-3717).
Upgrade deeplearning4j to 1.0.0-M2 (TIKA-3458 and PR#527).
Various dependency upgrades, including POI, dl4j, gson, jackson,
twelvemonkeys, log4j2 and others (TIKA-3675 and many PRs from dependabot).
Release 2.3.0 - 02/02/2022
Upgrade to Apache POI 5.2.0. This is the first upgrade to POI
5.x and represents a major refactoring. Users may experience
significantly more logging (TIKA-3164).
Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.
Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
- `@dependabot rebase` will rebase this PR
- `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it
- `@dependabot merge` will merge this PR after your CI passes on it
- `@dependabot squash and merge` will squash and merge this PR after your CI passes on it
- `@dependabot cancel merge` will cancel a previously requested merge and block automerging
- `@dependabot reopen` will reopen this PR if it is closed
- `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
- `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
Bumps tika-core from 1.16 to 2.3.0.
Changelog
Sourced from tika-core's changelog.
... (truncated)
Commits
02e0f62
[maven-release-plugin] prepare release 2.3.0-rc137c04a4
fix rat:check errors25caa2a
Update CHANGES.txt for 2.3.0 release.243701a
prep for 2.3.0 release processb578212
prep for 2.3.0 release processffc6c6e
update team list to include emeritus status and correct accent on Bertrand's ...cba7dee
TIKA-3640 -- general upgrades for the next 2.x version -- revert stax-ex3f6fc25
TIKA-3640 -- general upgrades for the next 2.x versionff9e044
Bump testcontainers from 1.16.2 to 1.16.3 (#494)afe946a
Bump slf4j.version from 1.7.33 to 1.7.35 (#493)Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting
@dependabot rebase
.Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)