Updated PST parser to use standard Message metadata keys and improved
handling of embedded files (TIKA-4248).
Convenience methods for XML readers were moved from ParseContext to
XMLReaderUtils (TIKA-4259).
Other Changes
Add GRPC server (TIKA-4181).
Improved configurability in tika-pipes (TIKA-4243).
Add optional PST parser based on libpst/readpst (TIKA-4250).
Release 3.0.0-BETA - 12/01/2023
BREAKING CHANGES
Require Java 11 (TIKA-4128).
The boilerpipe handler has been moved to the tika-handler-boiler-pipe
package (TIKA-4138).
We've migrated HTML parsing to the JSoup parser instead of TagSoup. If
you have a custom configuration on the HTMLParser, you'll need to change
that to o.a.t.p.html.JSoupParser (TIKA-1599).
Removed xerces2 as a dependency (TIKA-4135).
tika-core now has a scope of "provided" in most non-app modules (TIKA-4191).
Tika will look for "custom-mimetypes.xml" directly on the classpath, NOT
under "/org/apache/tika/mime/". (TIKA-4147).
Return media type "text/javascript" instead of "application/javascript"
to follow RFC-9239. (TIKA-4119).
Other Changes/Updates
Improve detection of sqlite3-based file formats (TIKA-4187).
Upgrade PDFBox to 3.0.1 (TIKA-3347)
Deprecated AbstractParser for removal in 4.x (TIKA-4132).
Fix bug in DateUtils that stripped timezone information from
Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.
Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
- `@dependabot rebase` will rebase this PR
- `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it
- `@dependabot merge` will merge this PR after your CI passes on it
- `@dependabot squash and merge` will squash and merge this PR after your CI passes on it
- `@dependabot cancel merge` will cancel a previously requested merge and block automerging
- `@dependabot reopen` will reopen this PR if it is closed
- `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
- `@dependabot show ignore conditions` will show all of the ignore conditions of the specified dependency
- `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
Bumps org.apache.tika:tika-core from 2.8.0 to 2.9.2.
Changelog
Sourced from org.apache.tika:tika-core's changelog.
... (truncated)
Commits
1dbf284
[maven-release-plugin] prepare release 2.9.2-rc27a36751
revert version for rc2a501d0c
Revert writing of all file paths for embedded contents of epub (TIKA-4219)a4dc6b9
[maven-release-plugin] prepare for next development iteration52af992
[maven-release-plugin] prepare release 2.9.2-rc1ddaf3b0
Update CHANGES.txt in prep for 2.9.2 release2d585b8
TIKA-4219 -- clean up...do not include font names in main package88b582f
Merge remote-tracking branch 'origin/branch_2x' into branch_2x800b551
update CHANGES.txtde408df
TIKA-4223 -- add detection of stl (#1691)Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting
@dependabot rebase
.Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show